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OM protein - protein search, using sw model 
Run on: March 26, 2004, 15:30:32 ; 



Search time 50.5 Seconds 

(without alignments) 

44.760 Million cell updates/sec 



Title: US-09-668-314C-71 

Perfect score: 8 

Sequence: 1 EVNLDAEF 8 

Scoring table: OLIGO 

Gapop 60.0 , Gapext 60.0 



Searched 



1586107 seqs, 282547505 residues 



Word size : 



0 



Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Listing first 1000 summaries 



1586107 



Database : 



A_Geneseq_2 9 Jan04 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2 002s : * 

6: geneseqp2003as : * 

7: geneseqp2003bs : * 

8: geneseqp2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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993 


5 


62 . 


5 


763 


6 


ABU20715 


Abu20715 


Protein e 


994 


5 


62. 


5 


783 


5 


ABP35616 


Abp35616 


Fungal ZB 


995 


5 


62. 


5 


784 


4 


ABG30180 


Abg30180 


Novel hum 


996 


5 


62. 


5 


786 


4 


ABG09526 


Abg09526 


Novel hum 


997 


5 


62. 


5 


788 


6 


ABU45872 


Abu45872 


Protein e 


998 


5 


62 . 


5 


796 


5 


ABP73364 


Abp73364 


Candida a 


999 


5 


62. 


5 


815 


6 


ABU38957 


Abu38957 


Protein e 


1000 


5 


62. 


5 


819 


6 


ABU49933 


Abu49933 


Protein e 



ALIGNMENTS 



RESULT 1 
AAY94771 

ID AAY94771 standard; protein; 8 AA. 
XX 

AC AAY94771; 
XX 

DT 12-FEB-2001 (first entry) 
XX 

DE Beta-secretase substrate peptide SEQ ID 17. 
XX 

KW Beta-secretase; enzyme; amyloid plaque; Alzheimer's disease; 

KW Down's syndrome; amyloid angiopathy; gene therapy; neuroprotective. 

XX 

OS Synthetic. 
XX 

PN W02 00058 4 7 9-Al . 
XX 

PD 05-OCT-2000. 
XX 

PF 23-MAR-2000; 2 00 0WO-US 0077 55 . 
XX 

PR 26-MAR-1999; 99US-00277229 . 
XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Citron M, Vassar RJ, Bennett BD; 
XX 

DR WPI; 2000-594643/56. 
XX 

PT Isolated beta-secretase nucleic acids and encoded polypeptides, useful 



PT for diagnosis and gene therapy of Alzheimer's disease. 
XX 

PS Example 10; Page 117; 145pp; English. 
XX 

CC This invention relates to 3 nucleotide sequences encoding beta-secretase 

CC proteins. Beta-secretase is an enzyme involved in the production of one 

CC of the components of amyloid plaques involved in Alzheimer's disease. The 

CC invention includes an expression vector comprising the nucleotide 

CC sequence, a host cell comprising the expression vector, and a process for 

CC producing the protein through culturing the transformed cells. Also 

CC included in the invention are a polypeptide derivative of the beta- 

CC secretase protein, a fusion protein comprising beta-secretase fused to a 

CC heterologous amino acid sequence, and a method for modulating the levels 

CC of beta-secretase polypeptide in a mammal comprising administering the 

CC polynucleotide sequence. Beta-secretase exhibits neuroprotective and 

CC nootropic activity. The beta-secretase nucleotide sequence may be used to 

CC map locations of the beta-secretase gene and related genes on chromosomes 

CC and as hybridization probes in diagnostic assays to test for the presence 

CC of beta-secretase DNA or RNA, such as in Alzheimer's disease, Down's 

CC syndrome, and amyloid angiopathy. The nucleotide sequence may also be 

CC used as anti-sense inhibitors of beta-secretase expression, in gene 

CC therapy of Alzheimer's disease, and for the identification of compounds 

CC that modulate beta-secretase activity. Antibodies to the beta-secretase 

CC protein may be used for in vitro and in vivo diagnostic purposes to 

CC detect the presence of beta-secretase polypeptide in a body fluid or cell 

CC sample. The present sequence represents a beta-secretase substrate 

CC peptide 

XX 

SQ Sequence 8 AA; 



Query Match 100.0%; Score 8; DB 3; Length 8; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 1 EVNLDAEF 8 

1 I I I II I I 
Db 1 EVNLDAEF 8 



RESULT 2 
AAE10661 

ID AAE10661 standard; peptide; 8 AA. 
XX 

AC AAE10661; 
XX 

DT 10-DEC-2001 (first entry) 
XX 

DE Human aspartyl protease-1 beta-secretase Swedish mutant peptide. 
XX 

KW Human; aspartyl protease 1; Aspl; amyloid precursor protein; APP; 

KW Alzheimer's disease; AD ; dementia; neurofibrillary tangle; gliosis; 

KW amyloid plaque; neuronal loss; proteolytic; nootropic; neuroprotective; 

KW aspartyl protease-1 beta-secretase Swedish mutant peptide. 

XX 

OS Homo sapiens . 

OS Synthetic. 
XX 



FH 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



Key Location/Qualifiers 
Cleavage-site 4 . .5 

GB2357767-A. 

04- JUL-2001 . 

22-SEP-2 000; 2 0 00GB- 00 02 3315 . 



23-SEP-1999 
23-SEP-1999 
23-SEP-1999 
13-OCT-1999 
06-DEC-1999 



99US-00404133 
99US-0155493P 
99WO-US020881 
99US-00416901 
99US-0169232P 



(PHAA ) PHARMACIA & UPJOHN CO. 
Bienkowkski MJ, Gurney M; 
WPI; 2001-444208/48. 

Polypeptide comprising fragments of human aspartyl protease with amyloid 
precursor protein processing activity and alpha-secretase activity, for 
identifying modulators useful in treating Alzheimer's disease. 

Example 15 ; Page 92 ; 187pp ; English . 

The patent discloses human aspartyl protease 1 (hu-Aspl) or modified Aspl 
proteins which lack transmembrane domain or amino terminal domain or 
cytoplasmic domain and retains alpha-secretase activity and amyloid 
protein precursor (APP) processing activity. The proteins of the 
invention are useful for assaying hu-Aspl alpha-secretase activity, which 
in turn is useful for identifying modulators of hu-Aspl alpha-secretase 
activity, where modulators that increase hu-Aspl alpha-secretase activity 
are useful for treating Alzheimer's disease (AD) which causes progressive 
dementia with consequent formation of amyloid plaques, neurofibrillary 
tangles, gliosis and neuronal loss. Hu-Aspl protease substrate is useful 
for assaying hu-Aspl proteolytic activity, by contacting hu-Aspl protein 
with the substrate under acidic conditions and determining the level of 
hu-Aspl proteolytic activity. The present sequence is human aspartyl 
protease-1 (hu-Asp-1) beta-secretase Swedish (Sw) mutant peptide which is 
used for determining the enzymatic activity of Asp-1 protein lacking a 
transmembrane (TM) domain and containing (His) 6 tag 

Sequence 8 AA; 



Query Match 100.0%; Score 8; DB 4 ; Length 8; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 EVNLDAEF 8 

I I I I I I I I 
1 EVNLDAEF 8 



RESULT 3 
AAE02613 



ID AAE02613 standard; peptide; 8 AA. 
XX 

AC AAE02613; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Human Aspartyl protease-1 beta-secretase Swedish mutant form peptide. 
XX 

KW Human; alpha-secretase ; amyloid precursor protein; APP; therapy; 
KW Alzheimer's disease; antial zheimer 1 s ; aspartyl protease 1; Aspl; 
KW beta-secretase. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT Cleavage-site 4. .5 

XX 

PN WO200123533-A2 . 
XX 



PD 


05 


-APR- 


2001. 






XX 












PF 


22 


-SEP- 


2000; 


2 00 0WO- 


US026080. 


XX 












PR 


23 


-SEP- 


1999; 


99US- 


0155493P. 


PR 


23 


-SEP- 


1999; 


9 9WO- 


US020881 . 


PR 


13 


-OCT- 


1999; 


99US- 


00416901 . 


PR 


06 


-DEC- 


1999; 


99US- 


0169232P. 



XX 

PA ( PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Gurney M, Bienkowski MJ; 
XX 

DR WPI; 2001-290516/30. 
XX 

PT Enzymes that cleave the alpha-secretase site of the amyloid precursor 

PT protein, useful for the treatment of Alzheimer's disease. 

XX 

PS Example 15; Page 94; 189pp; English. 
XX 

CC The present invention relates to enzymes for cleaving the alpha- 

CC secretase site of the amyloid precursor protein (APP) and methods of 

CC identifying those enzymes. The methods may be used to identify enzymes 

CC that may be used to cleave the alpha-secretase cleavage site of the APP 

CC protein. The enzymes may be used to treat or modulate the progress of 

CC Alzheimer's disease. The present sequence is human Aspartyl protease-1 

CC (hu-Asp-1) beta-secretase, Swedish (Sw) mutant form peptide which is us 

CC for determining the enzymatic activity of Asp-1 deltaTM (His) 6 protein 
XX 

SQ Sequence 8 AA; 

Query Match 100.0%; Score 8; DB 4; Length 8; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 EVNLDAEF 8 

I I ! I 1 I II 

Db 1 EVNLDAEF 8 



RESULT 4 
ABB78622 

ID ABB78622 standard; peptide; 8 AA. 
XX 

AC ABB7 8 622; 
XX 

DT 16-JUL-2002 (first entry) 
XX 

DE Human beta secretase peptide SEQ ID NO : 7 1 . 
XX 

KW Human; Asp-1; Asp-2; aspartyl protease; Alzheimer's disease; proteolytic. 
XX 

OS Homo sapiens . 
XX 

PN GB2367060-A. 
XX 

PD 27-MAR-2002 . 
XX 

PF 29-OCT-2001; 2 00 1GB-0002 5 934 . 
XX 

PR 23-SEP-1999; 99US-004 04 133 . 

PR 23-SEP-1999; 99US- 01554 93P . 

PR 23-SEP-1999; 99WO-US02 0 8 8 1 . 

PR 13-OCT-1999; 99US-004 16901 . 

PR 06-DEC-1999; 99US- 01692 32P . 

PR 22-SEP-2000; 2 000GB- 00023315 . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Bienkowkski MJ, Gurney M; 
XX 

DR WPI; 2002-397167/43. 
XX 

PT Human aspartyl protease 1 substrates useful in assays to detect aspartyl 

PT protease activity, e.g. for the diagnosis of Alzheimer's disease. 

XX 

PS Example 15; Page 92; 182pp; English. 
XX 

CC The present invention describes a human aspartyl protease 1 (hu-Aspl) 

CC substrate (I) which comprises a peptide of no more than 50 amino acids, 

CC and which comprises the 8 amino acid sequence Gly-Leu-Ala-Leu-Ala-Leu- 

CC Glu-Pro. Also described are: (1) a method (II) for assaying hu-Aspl 

CC proteolytic activity, comprising: (a) contacting a hu-Aspl protein with 

CC (I) under acidic conditions; and (b) determining the level of hu-Aspl 

CC proteolytic activity; (2) a purified polynucleotide (III) comprising a 

CC nucleotide sequence that hybridises under stringent conditions to the non 

CC -coding strand complementary to a defined 1804 nucleotide sequence (see 

CC ABL52456) where the nucleotide sequence encodes a polypeptide having Aspl 

CC proteolytic activity and lacks nucleotides encoding a transmembrane 

CC domain); (3) a purified polynucleotide (III 1 ) comprising a sequence that 

CC hybridises under stringent conditions to (III) (the nucleotide sequence 

CC encodes a polypeptide further lacking a pro-peptide domain corresponding 

CC to amino acids 23-62 of hu-Aspl (see ABB78589)); (4) a vector (IV) 

CC comprising (III) or (III 1 ); and (5) a host cell (V) transformed or 

CC transfected with (III), (III 1 ) and/or (IV). The hu-Aspl protease 



CC substrate (I) may be used as an enzyme substrate in assays to detect 

CC aspartyl protease activity, (II) and therefore diagnose diseases 

CC associated with aberrant hu-Aspl expression and activity such as 

CC Alzheimer's disease. Hu-Aspl has been localised to chromosome 21, while 

CC hu-Asp2 has been localised to chromosome 1 lq2 3 . 3-2 4 . 1 . The present 

CC sequence represents a human beta secretase peptide, which is used in an 

CC example from the present invention 

XX 

SQ Sequence 8 AA; 

Query Match 100.0%; Score 8; DB 5; Length 8; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

MINIM 

Db 1 EVNLDAEF 8 



RESULT 5 
ABR54159 

ID ABR54159 standard; peptide; 8 AA. 
XX 

AC ABR54159; 
XX 

DT 19-JUN-2003 (first entry) 
XX 

DE Beta-secretase substrate peptide #257. 
XX 

KW Beta-secretase; nootropic; neuroprotective; Secretase Inhibitor Beta; 

KW substrate; beta-APP; amyloid precursor protein; Alzheimer's disease; 

KW amyloid beta peptide deposition; Down's syndrome. 
XX 

OS Synthetic . 
XX 

PN WO200294985-A2 . 
XX 

PD 28-NOV-2002 . 
XX 

PF 17-MAY-2002; 2002WO-US015590 . 
XX 

PR 22-MAY-2001; 2001US-0292591P . 

PR 30-AUG-2001; 2001US-0316115P . 
XX 

PA (MERI ) MERCK & CO INC. 
XX 

PI Bruce JE, Tang X, Garsky V, Brady SF, Li Y, Chen-Dodson E; 

PI Sardana M, Shafer JA; 

XX 

DR WPI; 2003-148462/14. 
XX 

PT New beta-secretase substrate peptides, useful as substrates in assays for 

PT measuring beta-secretase activity, or for screening test compounds for 

PT the ability to inhibit beta-secretase activity used for treating 

PT Alzheimer's disease. 
XX 

PS Example 2; Fig 1; 103pp; English. 



XX 

CC The invention relates to a novel substantially pure beta -seer etas e 

CC substrate peptide. A peptide of the invention has nootropic, and 

CC neuroprotective activity . The peptide acts as a Secretase Inhibitor Beta . 

CC The beta- secretase substrate peptides are useful as substrates in various 

CC assays for measuring beta-secretase activity. The methods are useful for 

CC screening test compounds for the ability to inhibit beta-secretase 

CC activity, or inhibiting the cleavage of beta-APP (amyloid precursor 

CC protein) in mammalian hosts, which can be used for treating conditions 

CC related to amyloid beta peptide deposition, e.g. Alzheimer's disease or 

CC Down's syndrome. The present sequence represents a beta secretase 

CC substrate peptide of the invention 

XX 

SQ Sequence 8 AA; 

Query Match 100.0%; Score 8; DB 6; Length 8; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I II I 
Db 1 EVNLDAEF 8 



RESULT 6 
AAW82081 

ID AAW82081 standard; peptide; 9 AA. 
XX 

AC AAW82081; 
XX 

DT 18-FEB-1999 (first entry) 
XX 

DE Fluor o genie protease indicator protease binding peptide #59 . 
XX 

KW Protease activity; f luorphore ; detection ; f luorogenic ; cellular uptake ; 

KW conformation change . 

XX 

OS Synthetic. 
XX 

PN W09837226-A1. 
XX 

PD 27-AUG-1998 . 
XX 

PF 2 0-FEB-1998 ; 9 8WO-US00300 0 . 
XX 

PR 2 0-FEB-19 97; 97US-008 02 98 1 . 
XX 

PA (ONCO-) ONCOIMMUNIN INC. 
XX 

PI Komoriya A, Packard BS ; 
XX 

DR WPI ; 1998-4 67 57 9/40. 
XX 

PT New f luorogenic compositions - containing 2 f luorophores separated by a 
PT peptide comprising a protease binding site, used for detecting protease 
PT activity in samples. 
XX 



PS Claim 4; Page 77; 90pp; English. 
XX 

CC AAW82023-W82240 are peptides used in the construction of a fluorogenic 

CC composition which is used for the detection of protease activity in 

CC biological samples . The products can be used for the detection of 

CC conformation changes in nucleic acids, oligosaccharides, polysaccharides, 

CC proteins, peptides, lipids, phopholipids , glycolipids, glycoproteins, 

CC steroids or polymers . In addition, attachment of a hydrophobic group to a 

CC molecule can be used to enhance uptake by cells. The composition is 

CC composed of P = peptide comprising a protease binding site for the 

CC protease, Fl, F2 peptides = fluorophores where Fl is attached to the 

CC amino terminal amino acid and F2 is attached to the carboxyl terminal 

CC amino acid and SI, S2 peptides = when present, are peptide spacers where 

CC Si, when present, is attached to the amino terminal acid, and S2, when 

CC present, is attached to the carboxyl terminal amino acid 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 2; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 7 




AAB07874 




ID 


AAB07874 standard; peptide; 9 AA. 


XX 






AC 


AAB07874 ; 




XX 






DT 


14-NOV-2 000 


(first entry) 


XX 






DE 


A peptide fragment derived from beta-amyloid precursor protein. 


XX 






KW 


Beta-secretase ; beta-amyloid precursor protein; beta-amyloid peptide; 


KW 


amyloid plaque component; Alzheimer's disease; amyloidogenic disease; 


KW 


inhibitor . 




XX 






OS 


Homo sapiens 


• 


XX 






PN 


WO200047618- 


A2 . 


XX 






PD 


17-AUG-2000. 




XX 






PF 


10-FEB-2000; 


2000WO-US003819 . 


XX 






PR 


10-FEB-1999; 


99US-0119571P. 


PR 


15- JUN-1999; 


99US-0139172P. 


XX 






PA 


(ELAN-) ELAN 


PHARM INC. 


XX 






PI 


Anderson JP, 


Basi G, Doane MT, Frigon N, John V, Power M; 


PI 


Sinha S, Tatsuno G, Tung J, Wang S, Mcconlogue L; 


XX 







DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



WPI; 2 0 00-533011/4 8 . 

Purified beta-secretase protein used in assays to discover inhibitors 
which can be used for the treatment of amyloidogenic diseases e.g. 
Alzheimer's disease. 

Disclosure; Page 12; 121pp; English. 

The specification describes a beta-secretase enzyme. The enzyme cleaves 
beta-amyloid precursor protein to produce beta-amyloid peptide. This 
enzyme is therefore implicated in the production of amyloid plaque 
components which accumulate in the brains of individuals afflicted with 
Alzheimer's disease. Inhibitors of beta-secretase are administered to a 
mammalian subject e.g. with Alzheimer's disease or Alzheimer's disease- 
like pathology to test if they maintain or improve cognitive ability or 
reduce the plaque burden. The compounds are used for the treatment of 
amyloidogenic diseases e.g. Alzheimer's disease. The present sequence 
represents a peptide derived from beta-amyloid precursor protein 

Sequence 9 AA; 



Query Match 100.0%; Score 8; DB 3; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 



0; Gaps 



Qy 


1 EVNLDAEF 8 




Db 


1 1 1 I ! I 1 1 
2 EVNLDAEF 9 




RESULT 8 




AAB07894 




ID 


AAB07894 standard; peptide; 9 AA. 




XX 






AC 


AAB07894 ; 




XX 






DT 


14-NOV-2000 (first entry) 




XX 






DE 


Substrate for beta-secretase enzyme. 




XX 






KW 


Beta-secretase; beta-amyloid precursor protein; 


beta- amyloid 


KW 


amyloid plaque component; Alzheimer's disease; 


amyloidogenic 


KW 


inhibitor . 




XX 






OS 


Synthetic . 




XX 






FH 


Key Location/Qualifiers 




FT 


Cleavage-site 5 . .6 




XX 






PN 


WO200047618-A2. 




XX 






PD 


17-AUG-2000. 




XX 






PF 


10-FEB-2 000; 2 0 00WO-US 0038 1 9 . 




XX 






PR 


10-FEB-19 99; 99US-01 1957 IP . 




PR 


15-JUN-19 99; 99US-0139 172P . 





XX 

PA (ELAN-) ELAN PHARM INC. 
XX 

PI Anderson JP, Basi G, Doane MT, Frigon N, John V, Power M; 

PI Sinha S, Tatsuno G, Tung J, Wang S, Mcconlogue L; 

XX 

DR WPI; 2000-533011/48. 
XX 

PT Purified beta-secretase protein used in assays to discover inhibitors 

PT which can be used for the treatment of amyloidogenic diseases e.g. 

PT Alzheimer's disease. 
XX 

PS Example 4; Page 71; 121pp; English. 
XX 

CC The specification describes a beta-secretase enzyme. The enzyme cleaves 

CC beta-amyloid precursor protein to produce beta-amyloid peptide. This 

CC enzyme is therefore implicated in the production of amyloid plaque 

CC components which accumulate in the brains of individuals afflicted with 

CC Alzheimer's disease. Inhibitors of beta-secretase are administered to a 

CC mammalian subject e.g. with Alzheimer's disease or Alzheimer's disease- 

CC like pathology to test if they maintain or improve cognitive ability or 

CC reduce the plaque burden. The compounds are used for the treatment of 

CC amyloidogenic diseases e.g. Alzheimer's disease. The present sequence 

CC represents a peptide substrate used to test the activity of beta- 

CC secretase enzyme 
XX 

SQ Sequence 9 AA; 



Query Match 100.0%; Score 8; DB 3; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 9 
AAG73297 

ID AAG73297 standard; peptide; 9 AA. 
XX 

AC AAG73297; 
XX 

DT 14-AUG-2001 (first entry) 
XX 

DE Protease indicator compound peptide #2 6. 
XX 

KW Protease detection; peptide cleavage; enzyme activity; fluorogenic; 
KW viral infection; cancer metastasis; emphysema; arthritis; thrombosis; 
KW haemophilia. 
XX 

OS Synthetic. 
XX 

PN WO200118238 - Al . 
XX 

PD 15-MAR-2001. 
XX 



PF ll-SEP-2000; 2000WO-US024882 . 
XX 

PR 10-SEP-1999; 99US-00394019 . 
XX 

PA (ONCO-) ONCOIMMUNIN INC. 
XX 

PI Komoriya A, Packard BS ; 
XX 

DR WPI; 2001-389573/41. 
XX 

PT New fluorogenic compositions whose fluorescence level increases in the 

PT presence of active proteases, useful for detecting and localizing 

PT protease activity in biological samples, particularly in frozen tissue 

PT samples . 

XX 

PS Claim 1; Page 70; 8 6pp ; English. 
XX 

CC The present invention describes fluorogenic compositions which can be 

CC used for the detection of protease activity. This can be useful as an 

CC indicator of viral infection, cancer metastasis, haemophilia, emphysema 

CC thrombosis and arthritis . The fluorogenic compositions comprise a 

CC peptide, a peptide spacer and a donor and an acceptor fluorophore. The 

CC peptide is cleaved by a protease and the fluorophores can then be 

CC detected. The present sequence is one of the peptides described in the 

CC exemplification of the invention 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 4; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 EVNLDAEF 8 

I I f i I I 1 I 
Db 2 EVNLDAEF 9 



RESULT 10 
ABB06519 

ID ABB06519 standard; peptide; 9 AA. 
XX 

AC ABB06519; 
XX 

DT 31-MAY-2002 (first entry) 
XX 

DE Beta-secretase related peptide SEQ ID NO: 113. 
XX 

KW Beta-secretase; enzyme; cleavage site; amyloid protein precursor; APP; 
KW aspartyl protease; neuroprotective; nootropic; beta-secretase inhibitor 
KW Alzheimer's disease. 
XX 

OS Homo sapiens. 
OS Synthetic. 
XX 

PN WO200206306-A2 . 
XX 

PD 24- JAN-2002 . 



XX 

PF 19-JUL-2001; 2001WO-US023035. 
XX 

PR 19- JUL-2000; 2000US-02 197 95P . 

PR 12-MAR-2001; 2001US-0275251P . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Yan R, Tomasselli AG, Gurney ME, Emmons TL, Bienkowski MJ; 

PI Heinrikson RL; 

XX 

DR WPI; 2002-216995/27. 
XX 

PT Novel substrates for human aspartyl protease useful for identifying 

PT modulators of beta secretase activity of aspartyl protease for treating 

PT Alzheimer's disease. 

XX 

PS Disclosure; Page 156; 188pp; English. 
XX 

CC The present invention describes an isolated peptide (I) comprising a 

CC sequence of at least four amino acids, where the peptide is a substrate 

CC for conducting aspartyl protease assays. (I) has neuroprotective and 

CC nootropic activities, and can be used as an inhibitor of beta-secretase 

CC activity. A beta-secretase modulator from the present invention can be 

CC used for inhibiting beta-secretase activity in vivo, and in the 

CC manufacture of a medicament for the treatment of Alzheimer's disease. 

CC Pharmaceutical compositions from the present invention can be used for 

CC treating a disease or condition characterised by an abnormal beta- 

CC secretase activity. (I) is useful for identifying agents that modulate 

CC the activity of human Asp2 aspartyl protease (Hu-Asp2) . (I) is useful as 

CC a core structure to construct derivatives. ABL49914 to ABL49925 and 

CC ABB06409 to ABB06593 represent sequences used in the exemplification of 

CC the present invention 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 5 ; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 EVNLDAEF 8 

II ! I I I I I 
Db 1 EVNLDAEF 8 



RESULT 11 
ABB09003 

ID ABB09003 standard; peptide; 9 AA. 
XX 

AC ABB09003; 
XX 

DT 19-JUN-2002 (first entry) 
XX 

DE Peptide #1 used as a tag to identify cleavage products. 
XX 

KW Amyloid precursor protein; APP; Alzheimer's disease; nootropic; 
KW neuroprotective; cerebroprotective; haemostatic; antiparkinsonian; 



KW Down's syndrome; cerebral amyloid angiopathy; dementia. 
XX 

OS Synthetic. 
XX 

FH Key Location/ Quali tiers 

FT Cleavage-site 5. .6 

FT /note= "Beta secretase cleavage site" 

XX 

PN WO200202505-A2 . 
XX 

PD 10-JAN-2002. 
XX 

PF 29-JUN-2001; 2 001WO-US02 0 8 52 . 
XX 

PR 30-JUN-2000; 2000US-0215323P . 
XX 

PA ( ELAN- ) ELAN PHARM INC. 
XX 

PI Fang LY, Horn R, John V, Maillaird M; 
XX 

DR WPI; 2002-171625/22. 
XX 

PT New substituted amines are e.g. beta-amyloid peptide inhibitors, useful 

PT for treating Alzheimer's disease, mild cognitive impairment and Down's 

PT syndrome. 
XX 

PS Example D; Page 96; 13 6pp; English. 
XX 

CC This invention relates to substituted amines and their salts, which are 

CC nootropic, neuroprotective, cerebroprotective , haemostatic and 

CC antiparkinsonian in their action. They are used in the preparation of a 

CC composition useful for treating, preventing or delaying the on-set of 

CC Alzheimer's disease, for treating mild cognitive impairment, Down's 

CC syndrome, cerebral amyloid angiopathy, dementia associated with 

CC Parkinson's disease, dementia associated with progressive supranuclear 

CC palsy, and dementia associated with cortical basal degeneration. They are 

CC also useful for treating diseases characterised by beta-amyloid deposits 

CC in brain, and for producing beta-secretase complexes. This sequence 

CC represents a peptide that encorporate the known cleavage site of beta- 

CC secretase. This substrate is used to assay beta-secretase activity 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 5; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 12 
ABU60429 

ID ABU60429 standard; peptide; 9 AA. 
XX 

AC ABU60429; 



XX 

DT 29-APR-2003 (first entry) 
XX 

DE Protease binding peptide motif SEQ ID 139. 
XX 

KW Protease; indicator; chromophore; H-dimer; fluorescence; absorbance; 

KW nuclease; screening; fluorophore; substrate cleavage. 

XX 

OS Synthetic . 
XX 

PN WO200261038-A2 . 
XX 

PD 08-AUG-2002 . 
XX 

PF 21-DEC-2001; 2 001WO-US04 97 8 1 . 
XX 

PR 22-DEC-2000; 2 0 OOUS-0 074 72 8 7 . 
XX 

PA (ONCO-) ONCOIMMUNIN INC. 
XX 

PI Packard BS, Komoriya A; 
XX 

DR WPI; 2002-698548/75. 
XX 

PT Indicator composition comprising polypeptide or nucleic acid backbone 

PT joining two same chromophores resulting in quenching of fluorescence 

PT of /change in absorbance of chromophores , useful for detecting protease 

PT activity. 
XX 

PS Disclosure; Page 34; 97pp; English. 
XX 

CC This invention describes a novel indicator composition (referred as homo- 

CC doubly labeled compositions) comprising a polypeptide backbone or a 

CC nucleic acid backbone joining two chromophores of the same species 

CC whereby the chromophores form an H-dimer resulting in quenching of the 

CC fluorescence of or a change in the absorbance of the chromophore, a 

CC decrease in fluorescence or a change in absorbance indicates that the 

CC first molecule and the second molecule are interacting. The indicator is 

CC useful for detecting the activity of a protease, where an increase in 

CC fluorescence or a change in absorbance indicates that the protease 

CC cleaves the polypeptide backbone. The indicator is attached to a solid 

CC support inside a mammalian, yeast or insect cell. The composition bears a 

CC hydrophobic group such as Fmoc, 9- f luoreneacetyl group, 1- 

CC f luorenecarboxylic group, 9-f lorenecarboxylic group, and 9-f luorenone-1- 

CC carboxylic group, benzyloxycarbonyl , Xanthyl (Xan) , Trityl (Trt) , 4- 

CC methyltrityl (Mtt), 4-methoxytrityl (Mmt), 4-methoxy-2 , 3 , 6-trimethyl- 

CC benzenesulphonyl (Mtr) , Mesitylene-2-sulphonyl (Mts), 4,4'- 

CC dimethoxybenzhydryl (Mbh) , etc. The method described in the invention is 

CC useful for detecting protease or nuclease activity (or the presence of 

CC nucleic acid) in histological section, cells in culture, (e.g., seeded or 

CC cultured adherent cells), a biological sample such as tissue, biopsy, 

CC lymph, embryo, or whole animal, or cell suspension derived from a 

CC biological sample such as tissue, blood, urine, saliva, lymph, or biopsy. 

CC The indicator composition is also useful for screening a test agent for 

CC the ability to modulate a protease (or a nuclease, lipase, etc.) . The 

CC indicator reagents allow rapid determination of protease activity in a 

CC matter of minutes in a single-step procedure. The fluorescent indicators 



CC both absorb and emit in the visible range (400-800 nm) . These signals are 

CC therefore not readily quenched by, nor is activation of the fluorophores, 

CC that is, absorption of light, interfered with by background molecules; 

CC therefore they are easily detected in biological samples. The fluorogenic 

CC protease indicators utilise high efficiency fluorophores and are able to 

CC achieve a high degree of quenching while providing a strong signal when 

CC the quench is released by cleavage of the peptide substrate. The high 

CC signal allows detection of very low levels of protease activity. Thus the 

CC fluorogenic protease indicators are particularly well suited for in situ 

CC detection of protease activity. ABU60357-ABU6047 7 represent peptides use 

CC to illustrate the method described in the disclosure of the invention 
XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 5; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I i I I I I 
Db 2 EVNLDAEF 9 



RESULT 13 
ABU60441 

ID ABU60441 standard; peptide; 9 AA. 
XX 

AC ABU60441; 
XX 

DT 29-APR-2003 (first entry) 
XX 

DE Protease binding peptide motif SEQ ID 165. 
XX 

KW Protease; indicator; chromophore; H-dimer; fluorescence; absorbance; 

KW nuclease; screening; fluorophore; substrate cleavage. 

XX 

OS Synthetic. 
XX 

PN WO200261038-A2 . 
XX 

PD 08-AUG-2002. 
XX 

PF 21-DEC-2001; 2 0 0 1WO-US04 97 8 1 . 
XX 

PR 22-DEC-2000; 2 000US-O0747287 . 
XX 

PA (ONCO-) ONCOIMMUNIN INC. 
XX 

PI Packard BS, Komoriya A; 
XX 

DR WPI ; 2002-698548/75. 
XX 

PT Indicator composition comprising polypeptide or nucleic acid backbone 
PT joining two same chromophores resulting in quenching of fluorescence 
PT of /change in absorbance of chromophores, useful for detecting protease 
PT activity. 
XX 



PS Disclosure; Page 35; 97pp; English. 
XX 

CC This invention describes a novel indicator composition (referred as homo- 

CC doubly labeled compositions) comprising a polypeptide backbone or a 

CC nucleic acid backbone joining two chromophores of the same species 

CC whereby the chromophores form an H-dimer resulting in quenching of the 

CC fluorescence of or a change in the absorbance of the chromophore, a 

CC decrease in fluorescence or a change in absorbance indicates that the 

CC first molecule and the second molecule are interacting. The indicator is 

CC useful for detecting the activity of a protease, where an increase in 

CC fluorescence or a change in absorbance indicates that the protease 

CC cleaves the polypeptide backbone. The indicator is attached to a solid 

CC support inside a mammalian, yeast or insect cell. The composition bears a 

CC hydrophobic group such as Fmoc, 9- f luoreneacetyl group, 1- 

CC f luorenecarboxylic group, 9- f lorenecarboxylic group, and 9- f luorenone-1- 

CC carboxylic group, benzyloxycarbonyl , Xanthyl (Xan) , Trityl (Trt) , 4- 

CC methyltrityl (Mtt) , 4-methoxytrityl (Mmt), 4-methoxy-2 , 3 , 6-trimethyl- 

CC benzenesulphonyl (Mtr) , Mesitylene-2-sulphonyl (Mts), 4,4'- 

CC dimethoxybenzhydryl (Mbh) , etc. The method described in the invention is 

CC useful for detecting protease or nuclease activity (or the presence of 

CC nucleic acid) in histological section, cells in culture, (e.g., seeded or 

CC cultured adherent cells), a biological sample such as tissue, biopsy, 

CC lymph, embryo, or whole animal, or cell suspension derived from a 

CC biological sample such as tissue, blood, urine, saliva, lymph, or biopsy. 

CC The indicator composition is also useful for screening a test agent for 

CC the ability to modulate a protease (or a nuclease, lipase, etc.) . The 

CC indicator reagents allow rapid determination of protease activity in a 

CC matter of minutes in a single-step procedure. The fluorescent indicators 

CC both absorb and emit in the visible range (400-800 nm) . These signals are 

CC therefore not readily quenched by, nor is activation of the f luorophores , 

CC that is, absorption of light, interfered with by background molecules; 

CC therefore they are easily detected in biological samples . The fluorogenic 

CC protease indicators utilise high efficiency fluorophores and are able to 

CC achieve a high degree of quenching while providing a strong signal when 

CC the quench is released by cleavage of the peptide substrate. The high 

CC signal allows detection of very low levels of protease activity. Thus the 

CC fluorogenic protease indicators are particularly well suited for in situ 

CC detection of protease activity. ABU60357-ABU60477 represent peptides use 

CC to illustrate the method described in the disclosure of the invention 
XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 5; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

MINIM 
Db 2 EVNLDAEF 9 



RESULT 14 
AAE16663 

ID AAE16663 standard; peptide; 9 AA. 
XX 

AC AAE16663; 
XX 



DT 09-APR-2002 (first entry) 
XX 

DE Oligopeptide substrate #1, used in the assay of the invention. 
XX 

KW Disubstituted amine derivative; Alzheimer's disease; Down's syndrome; 

KW degenerative disease; beta-amyloid deposit; mild cognitive impairment; 

KW hereditary cerebral haemorrhage; amyloidosis; degenerative dementia; 

KW cerebral amyloid angiopathy; Parkinson's disease; supranuclear palsy; 

KW cortical basal degeneration; AD; MCI; amyloid precursor protein; 

KW beta-secretase ; amyloid beta peptide; A beta peptide. 
XX 

OS Unidentified. 
XX 

FH Key Location/Qualifiers 

FT Cleavage-site 5. .6 

XX 

PN WO200202518-A2 . 
XX 

PD 10- JAN-2002 . 
XX 

PF 29-JUN-2001; 2 001WO-US02 0 85 6 . 
XX 

PR 30-JUN-2000; 2 000US-02 1532 3P . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA (PHAA ) PHARMACIA & UPJOHN CO. 

XX 

PI Beck JP, Gailunas A, Horn R, Jagodzinska B, John V, Maillaird M; 
XX 

DR WPI; 2002-122550/16. 
XX 

PT New disubstituted amine derivatives, useful for treating Alzheimer's 

PT disease and other degenerative diseases . 

XX 

PS Example D; Page 163; 2 8 6pp; English. 
XX 

CC The invention relates to disubstituted amine derivative compounds useful 

CC for treating Alzheimer's disease and other degenerative diseases. 

CC Compounds of the invention possess beta secretase inhibitory activity and 

CC are useful for inhibiting beta-secretase mediated cleavage of amyloid 

CC precursor protein (APP) and particularly, to inhibit the production of 

CC amyloid beta (A beta) peptide. They are useful for treating or preventing 

CC a disease characterised by beta-amyloid deposits in the brain e.g. for 

CC treating or preventing Alzheimer's disease (AD), for helping to prevent 

CC or delay the onset of Alzheimer's disease, for treating mild cognitive 

CC impairment (MCI), Down's syndrome, hereditary cerebral haemorrhage with 

CC the amyloidosis of the Dutch type, cerebral amyloid angiopathy and 

CC preventing its potential consequences i.e. single and recurrent lobar 

CC haemorrhages, for treating other degenerative dementias including 

CC dementias of mixed vascular and degenerative origin, dementia associated 

CC with Parkinson's disease, dementia associated with progressive 

CC supranuclear palsy, dementia associated with cortical basal degeneration 

CC and diffuse Lewy body type of Alzheimer's disease. The present sequence 

CC is oligopeptide substrate used in assay of the invention 

XX 

SQ Sequence 9 AA; 



Query Match 100.0%; Score 8; DB 5 ; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 
Matches 8; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 1 EWLDAEF 8 

I I I I i I I E 

Db 2 EVNLDAEF 9 



RESULT 15 
AAU74837 

ID AAU74837 standard; peptide; 9 AA. 
XX 

AC AAU74837; 
XX 

DT 09-APR-2002 (first entry) 
XX 

DE Synthetic amyloid precursor protein (APP) oligopeptide substrate #1. 
XX 

KW Amyloid precursor protein; APP; Alzheimer's disease; amine XV; nootropic; 

KW neuroprotective; cerebroprotective ; haemostatic; antiparkinsonian; 

KW beta-secretase; amyloid beta peptide; amyloidosis; beta-amyloid plaque; 

KW mild cognitive impairment; MCI; Down's syndrome; 

KW hereditary cerebral haemorrhage; cortical basal degeneration; 

KW cerebral amyloid angiopathy; lobar haemorrhage; degenerative dementia; 

KW Parkinson's disease; supranuclear palsy; dementia. 

XX 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 
FT Cleavagesite 5 . .6 

FT /note= "Beta-secretase cleavage site" 

XX 

PN WO200202506-A2 . 
XX 

PD 10-JAN-2002. 
XX 

PF 29-JUN-2001; 2 0 01WO-US02 093 0 . 
XX 

PR 30-JUN-2000; 2 000US-0215323P . 
XX 

PA (ELAN-) ELAN PHARM INC. 
XX 

PI Fang LY, John V; 
XX 

DR WPI; 2002-147995/19. 
XX 

PT New substituted amines are e.g. amyloid beta peptide production 

PT inhibitors, useful for treating or preventing e.g. Alzheimer's disease, 

PT mild cognitive impairment and other degenerative diseases . 

XX 

PS Example D; Page 118; 434pp; English. 
XX 

CC This invention relates to a novel substituted amine of formula XV and its 

CC salts which can be used to treat Alzheimer's disease and similar 

CC diseases . The compound of the invention may have nootropic, 

CC neuroprotective, cerebroprotective, haemostatic and antiparkinsonian 



CC activities. The amine acts as a beta-secretase activity inhibitor, 

CC inhibiting cleavage of amyloid precursor protein (APP) and the formation 

CC of amyloid beta peptide and beta-amyloid plaque production. The amine of 

CC the invention is useful for treating or preventing a disease 

CC characterised by beta-amyloid deposits in the brain e.g. Alzheimer's 

CC disease (AD), for treating mild cognitive impairment (MCI), preventing or 

CC delaying the onset of Alzheimer's disease in those who would progress 

CC from MCI to AD, for treating Down's syndrome, humans who have hereditary 

CC cerebral haemorrhage with the Amyloidosis of the Dutch type, cerebral 

CC amyloid angiopathy and preventing its potential consequences i.e. single 

CC and recurrent lobar haemorrhages, for treating other degenerative 

CC dementias, including dementias of mixed vascular and degenerative origin, 

CC dementia associated with Parkinson's disease, dementia associated with 

CC progressive supranuclear palsy, dementia associated with cortical basal 

CC degeneration and diffuse Lewy body type of Alzheimer's disease. The 

CC present sequence represents a synthetic APP oligopeptide substrate #1 for 

CC the beta secretase enzyme. This synthetic substrate was used to test the 

CC efficiency of the amine as an inhibitor of beta secretase enzyme activity 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 5; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 EVNLDAEF 8 

MINIM 

Db 2 EVNLDAEF 9 



RESULT 16 
ABB07598 

ID ABB07598 standard; peptide; 9 AA. 
XX 

AC ABB07598; 
XX 

DT 08-MAY-2002 (first entry) 
XX 

DE Synthetic oligopeptide substrate having beta-secretase cleavage site. 
XX 

KW APP; amine derivative; beta-secretase; amyloid precursor protein; A beta; 
KW amyloid beta peptide; beta-amyloid; nootropic; neuroprotective; 
KW cerebroprotective ; haemostatic; antiparkinsonian. 
XX 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 
FT Cleavage-site 5 . .6 

FT /note= "beta-secretase cleavage site" 

XX 

PN WO200202520-A2 . 
XX 

PD 10-JAN-2002. 
XX 

PF 02-JUL-2001; 2 0 01WO-US02 1000 . 
XX 

PR 30-JUN-2000; 2 0 00US-02 15323P . 



PR 29- JUN-2001; 2001US- 008 95843 . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA (PHAA ) PHARMACIA & UPJOHN CO. 

XX 

PI Beck JP, Gailunas A, Horn R, Jagodzinska B, John V, Maillaird M; 
XX 

DR WPI; 2002-188416/24. 
XX 

PT New disubstituted amine derivatives, useful for treating Alzheimer's 

PT disease and other degenerative diseases. 

XX 

PS Example D; Page 163; 286pp; English. 
XX 

CC The invention relates to disubstituted amine derivatives of specified 

CC formulae or their salts. The amine derivatives can inhibit beta-secretase 

CC activity or inhibit the cleavage of amyloid precursor protein (APP) in a 

CC reaction mixture. They can also be used to inhibit production of amyloid 

CC beta peptide (A beta) in a cell or inhibit production of beta-amyloid 

CC plaque in an animal. The amine derivatives are useful for treating or 

CC preventing a disease characterized by beta— amyloid deposits in the brain 

CC e.g. for treating or preventing Alzheimer's disease (AD), for helping 

CC prevent or delay the onset of AD, for treating mild cognitive impairment 

CC (MCI), preventing or delaying the onset of AD in those who would progress 

CC from MCI to AD, for treating Down's syndrome, humans who have hereditary 

CC cerebral haemorrhage with the Amyloidosis of the Dutch type, cerebral 

CC amyloid angiopathy and preventing its potential consequences i.e. single 

CC and recurrent lobar haemorrhages, for treating other degenerative 

CC dementias, including dementias of mixed vascular and degenerative origin, 

CC dementia associated with Parkinson's disease, dementia associated with 

CC progressive supranuclear palsy, dementia associated with cortical basal 

CC degeneration, diffuse Lewy body type of Alzheimer's disease. Sequences 

CC ABB07598-599 are synthetic APP oligopeptide substrates containing the 

CC cleavage site of beta-secretase, used to assay beta-secretase activity 
XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 5 ; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 EVNLDAEF 8 

I II I ! I I I 
Db 2 EVNLDAEF 9 



RESULT 17 
AAM50897 

ID AAM50897 standard; peptide; 9 AA. 
XX 

AC AAM50897; 
XX 

DT 07-MAY-2002 (first entry) 
XX 

DE Oligopeptide substrate for beta-secretase. 
XX 

KW Beta-secretase; substrate; amyloid precursor protein; APP; 



KW amyloid beta peptide; inhibitor; screening; neurodegenerative disorder; 

KW cognitive impairment; Down's syndrome; Alzheimer's disease; 

KW hereditary cerebral haemorrhage; cerebral amyloid angiopathy; dementia; 

KW therapy. 

XX 

OS Synthetic . 
XX 

FH Key Location/Qualifiers 

FT Cleavage-site 5. .6 

XX 

PN WO200202512-A2 . 
XX 

PD 10- JAN-2002 . 
XX 

PF 29-JUN-2001; 2 00 1WO-US02 1012 . 
XX 

PR 30-JUN-2000; 2 0 0 0US- 02 15323P . 

PR 22-NOV-2000; 2 00 0US- 0252 73 6P . 

PR 15-DEC-2000; 2 000US-0255 95 6P . 

PR 13-FEB-2001; 2 001US-02 68 4 97 P . 

PR 29-MAR-2001; 2001US-027977 9P . 

PR 04-JUN-2001; 2 001US-02 9558 9P . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA ( PHAA ) PHARMACIA & UPJOHN CO. 

XX 

PI Maillaird M, Horn C, Gailunas A, Jagodzinska B, Fang LY, John V; 

PI Freskos JN, Pulley SR, Beck JP, Tenbrink RE; 

XX 

DR WPI; 2002-171627/22. 
XX 

PT New substituted amines are e.g. amyloid beta peptide production 

PT inhibitors, useful for treating or preventing e.g. Alzheimer's disease, 

PT mild cognitive impairment and other degenerative diseases . 

XX 

PS Example D; Page 344; 651pp; English. 
XX 

CC The present sequence is that of a peptide that incorporates the known 

CC cleavage site of beta-secretase on amyloid precursor protein (APP) . The 

CC peptide, which may optionally be tagged with a fluorescent or chromogenic 

CC moiety, can be used in assays to determine the beta-secretase inhibitory 

CC activity of novel substituted amine compounds of the invention. 

CC Comparison of substrate cleavage results in the presence of an inhibitor 

CC and in the presence of a control provides a measure of the compound's 

CC inhibitory activity. Novel compounds of the invention are effective 

CC inhibitors of beta-secretase, inhibit beta-secretase mediated cleavage of 

CC APP, are effective inhibitors of A beta production, and/or effectively 

CC reduce amyloid beta deposits or plaques . They are used in the treatment 

CC or prevention of diseases characterised by amyloid deposits in the brain, 

CC e.g. for treating or preventing Alzheimer's disease (AD), for helping 

CC prevent or delay the onset of AD, for treating mild cognitive impairment 

CC (MCI), preventing or delaying the onset of AD in those who would progress 

CC from MCI to AD, for treating Down's syndrome and hereditary cerebral 

CC haemorrhage with Amyloidosis of the Dutch type, for treating cerebral 

CC amyloid angiopathy and preventing its potential consequences i.e. single 

CC and recurrent lobar haemorrhages, for treating other degenerative 

CC dementias, including dementias of mixed vascular and degenerative origin, 



CC dementia associated with Parkinson's disease, dementia associated with 

CC progressive supranuclear palsy, and dementia associated with cortical 

CC basal degeneration, and for treating diffuse Lewy body type of AD (all 

CC claimed) 
XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 5; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 EVNLDAEF 8 

I I 1 I It II 
Db 2 EVNLDAEF 9 



RESULT 18 
ABP97975 

ID ABP97975 standard; peptide; 9 AA. 
XX 

AC ABP97975; 
XX 

DT 17-JUN-2003 (first entry) 
XX 

DE Synthetic substrate comprising beta-secretase cleavage site. 
XX 

KW Substrate; beta-secretase; alkylamide derivative; Alzheimer's disease; 

KW cognitive impairment; Down's syndrome; Hereditary Cerebral Haemorrhage; 

KW Parkinson's disease; cerebral amyloid angiopathy; degenerative dementia; 

KW supranuclear palsy; cortical basal degeneration. 
XX 

OS Synthetic. 
XX 

PN WO2003006423-A1 . 
XX 

PD 23-JAN-2003. 
XX 

PF ll-JUL-2002; 2 002WO-US022255 . 
XX 

PR ll-JUL-2001; 2 001US-0304525P . 

PR 30-JUL-2001; 2 001US-0308756P . 

PR 17-DEC-2001; 2 001US-034 1341P . 

PR 17-DEC-2001; 2 001US-034 1416P . 

PR 21-DEC-2001; 2 001US-034 4 872P . 

PR 21-DEC-2001; 2 001US-038 0574P . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA (PHAA ) PHARMACIA & UPJOHN CO. 

XX 

PI Gailunas A, Horn R, John V, Maillard M, Chrusciel RA, Fisher J; 

PI Jacobs J, Freskos JN, Brown DL, Fobian YM; 

XX 

DR WPI; 2003-239223/23. 
XX 

PT New N- ( 3-amino-2-hydroxypropyl ) alkylamide derivatives are beta secretase 

PT inhibitors used for treating e.g. Alzheimer's disease and Down's 

PT syndrome. 



XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



Example D; Page 139; 205pp; English. 

The present sequence represents a synthetic substrate comprising a beta- 
secretase cleavage site. The substrate was used to assay beta-secretase 
activity in the presence or absence of inhibitory compounds of the 
invention. The invention describes N— ( 3-Amino-2-hydroxypropyl ) alkylamide 
derivatives of a formula given in the specification. Compounds of the 
invention are used for treating or preventing Alzheimer's disease, mild 
cognitive impairment, Down 1 s syndrome, Hereditary Cerebral Haemorrhage 
with Amyloidosis of the Dutch-Type, cerebral amyloid angiopathy, other 
degenerative dementias and diffuse Lewy body type of Alzheimer's disease. 
They are also used for dementias of mixed vascular and degenerative 
origin, dementia associated with Parkinson's disease, dementia associated 
with progressive supranuclear palsy and dementia associated with cortical 
basal degeneration 

Sequence 9 AA; 



Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

MINIM 

Db 2 EVNLDAEF 9 



0; Gaps 



0; 



ABP57515; 

28-APR-2003 (first entry) 

Differentially isotopically labelled (DiMaS) peptide #8. 

Mass spectrometry; polymer; analysis; cleavage; substrate specificity; 
isotope; protease. 

Synthetic . 

WO2003001206-A1 . 

03-JAN-2003 . 

25-JUN-2 0 02 ; 2 0 02WO-GB0 02 921 . 
2 6-JUN-2 001; 2 0 01GB- 0001 5581 . 
(GLAX ) GLAXO GROUP LTD. 
Mckeown SC; 
WPI; 2003-184 066/18 . 



PT Analyzing cleavage of polymer, by providing polymer sample,- incubating 

PT the sample with labeled isotope for cleavage at potential cleavage site, 

PT and analyzing the masses of any uncleaved fragments by mass spectrometry. 
XX 

PS Example 3; Page 25; 73pp; English. 
XX 

CC The present invention describes a method (Ml) for analysing cleavage of a 

CC polymer. Ml comprises: (a) providing a sample of the polymer, a portion 

CC of the polymer molecules having been labeled at a position on one side of 

CC the potential cleavage site with a first isotopic label and a portion of 

CC the polymer molecules having been labeled at a position on the opposite 

CC side of the potential cleavage site with a second isotopic label; (b) 

CC incubating the sample under conditions suitable for cleavage at the 

CC potential cleavage site; and (c) analysing the mass(es) of any cleaved 

CC fragments by mass spectrometry and thereby determining whether and/or 

CC where cleavage has taken place. Ml is useful for analysing cleavage of a 

CC polymer, where the polymer is a linear polymer, and comprises a peptide 

CC or protein. Methods from the present invention can be used in discovering 

CC new or improved synthetic substrates for both known and unknown enzymes, 

CC e.g. enzymes identified from the human genome. The methods are also 

CC useful to identify the sequence origin, and in screening methods to 

CC identify new substrates for enzymes, in positional peptide scanning 

CC libraries, in in vivo/ex vivo/in vitro peptide, and in assaying methods 

CC for oligonucleotide or peptide sequencing and in measuring differential 

CC protein expression. The methods are useful for monitoring the cleavage of 

CC polypeptides or polynucleotides, and for determining optimal polymer 

CC substrates. ABP57505 to ABP57605 represent peptides used in the 

CC exemplification of the present invention 
XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 EVNLDAEF 8 


Db 


1 1 1 ! 1 1 1 1 
2 EVNLDAEF 9 


RESULT 2 0 


ABP57084 


ID 


ABP57084 standard; peptide; 9 AA. 


XX 




AC 


ABP57084 ; 


XX 




DT 


14-APR-2003 (first entry) 


XX 




DE 


Synthetic oligopeptide substrate SEQ ID NO: 8. 


XX 




KW 


Beta-hydroxyamine derivative; nootropic; neuroprotective; MCI; 


KW 


Alzheimer's disease; mild cognitive impairment; Down's syndrome; 


KW 


hereditary cerebral haemorrhage; amyloidosis; lobar haemorrhage; 


KW 


cerebral amyloid angiopathy; degenerative dementia; dementia; 


KW 


Parkinson's disease; progressive supranuclear palsy; APP; 


KW 


cortical basal degeneration; amyloid precursor protein; substrate. 


XX 





OS Synthetic. 
XX 

FH Key Location/Qualifiers 

FT Modif ied-site 1 

FT /note= "biotinylated" 

XX 

PN WO2003002122-A1 . 
XX 

PD 09-JAN-2003. 
XX 

PF 27-JUN-2002; 2 0 02WO-US02 07 54 . 
XX 

PR 27-JUN-2001; 2 001US-0301210P . 

PR 18-SEP-2001; 2 001US-0323396P . 

PR 19-NOV-2001; 2001US-0332736P . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA (PHAA ) PHARMACIA & UPJOHN CO. 

XX 

PI John V, Horn R, Tucker J; 
XX 

DR WPI; 2003-201464/19. 
XX 

PT New beta-hydroxyamine derivatives are beta secretase inhibitors used for 

PT treating e.g. Alzheimer's disease, Down's syndrome and cerebral amyloid 

PT angiopathy. 
XX 

PS Example D; Page 56; 96pp; English. 
XX 

CC The present invention describes beta-hydroxyamine derivatives {I). Also 

CC described is a method for the preparation of (I) . (I) has nootropic and 

CC neuroprotective activities. (I) can be used as an inhibitor of: beta- 

CC secretase activity; beta amyloid peptide (A beta) production; beta 

CC deposition; and amyloid precursor protein (APP) cleavage. (I) can be used 

CC for preventing or delaying the onset of Alzheimer's disease, for treating 

CC mild cognitive impairment (MCI), Down's syndrome, hereditary cerebral 

CC haemorrhage with amyloidosis of the Dutch-type and cerebral amyloid 

CC angiopathy and preventing its potential consequences i.e. single and 

CC recurrent lobar haemorrhages, for treating other degenerative dementias 

CC (including dementias of mixed vascular and degenerative origin, dementia 

CC associated with Parkinson's disease, dementia associated with progressive 

CC supranuclear palsy, dementia associated with cortical basal degeneration) 

CC or diffuse Lewy body type of Alzheimer's disease. The present sequence 

CC represents a synthetic oligopeptide substrate, which is used in the 

CC exemplification of the present invention 
XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I II I I 1 I 
Db 2 EVNLDAEF 9 



RESULT 21 
ABP71468 

ID ABP71468 standard; peptide; 9 AA. 
XX 

AC ABP71468; 
XX 

DT 15-MAY-2003 (first entry) 
XX 

DE Beta-secretase cleavage site comprising oligopeptide. 
XX 

KW APP; alpha-hydroxyamide statine; nootropic; neuroprotective; haemostatic 

KW beta-secretase; beta Amyloid; A beta; amyloid precursor protein; 

KW degenerative dementia; Alzheimer's disease; Down's syndrome; 

KW Hereditary Cerebral Haemorrhage with Amyloidosis of the Dutch-Type; 

KW cerebral amyloid angiopathy. 

XX 

OS Synthetic . 
XX 

FH Key Location/Qualifiers 

FT Cleavage-site 5 . .6 

FT /note= "beta-secretase cleavage site" 

XX 

PN WO2 003006021-A1 . 
XX 

PD 23- JAN-2003 . 
XX 

PF 10-JUL-2002; 2002WO-US02 1742 . 
XX 

PR 10-JUL-2001; 2001US-0304 128P . 

PR 05-OCT-2001; 2001US-0327424P . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA (PHAA ) PHARMACIA & UPJOHN CO. 

PA (SCHO/) SCHOSTAREZ H J. 

PA (CHRU/) CHRUSCIEL R A. 

XX 

PI Schostarez HJ, Chrusciel RA; 
XX 

DR WPI; 2003-267942/26. 
XX 

PT New alpha-hydroxyamide statine derivatives useful for the treatment of 

PT e.g. Alzheimer's disease. 

XX 

PS Example D; Page 97; 142pp; English. 
XX 

CC The invention relates to alpha-hydroxyamide statine derivatives of 

CC specified formula. The compounds are capable of slowing beta amyloid 

CC peptide (A beta) formation and/or deposition in the brain and are 

CC effective inhibitors of beta-secretase. They inhibit beta-secretase- 

CC mediated cleavage of APP and are effective inhibitors of A beta 

CC production to reduce amyloid beta deposit or plaques . The compounds are 

CC useful in the manufacture of a medicament for treating or preventing e.g 

CC Alzheimer's disease, mild cognitive impairment, Down's syndrome, 

CC Hereditary Cerebral Haemorrhage with Amyloidosis of the Dutch-Type, 

CC cerebral amyloid angiopathy, other degenerative dementias (including 

CC dementias of mixed vascular and degenerative origin, dementia associated 

CC with Parkinson's disease, dementia associated with progressive 



CC supranuclear palsy, dementia associated with cortical basal degeneration) 

CC or diffuse Lewy body type of Alzheimer's disease in a patient. The 

CC present sequence represents an oligopeptide that incorporates the known 

CC beta-secretase cleavage site, used in assays for beta-secretase 

CC inhibition by the inhibitory compounds of the invention 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 22 
AA016449 

ID AAQ16449 standard; peptide; 9 AA. 
XX 

AC AA016449; 
XX 

DT 10-APR-2003 (first entry) 
XX 

DE Beta-secretase synthetic APP substrate peptide, SEQ ID No 8. 
XX 

KW Alzheimer's disease; bicyclo compound; amyloid beta peptide; 

KW beta-amyloid plaque; cognitive activity; amyloid precursor protein; APP; 

KW degenerative dementia; Parkinson's disease; cerebral amyloid angiopathy; 

KW beta-secretase; substrate. 

XX 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 
FT Modified-site 1 

FT /note= "Labelled with biotin" 

FT Cleavage-site 5 . .6 

FT /note— "Cleaved between these two residues by beta- 

FT secretase" 

XX 

PN WO2003000261-A1 . 
XX 

PD 03-JAN-2003. 
XX 

PF 25-JUN-2002; 2 0 02WO-US02 0054 . 
XX 

PR 25-JUN-2001; 2 0 01US- 0300 67 IP . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA (PHAA ) PHARMACIA & UPJOHN CO. 

XX 

PI Beck JP; 
XX 

DR WPI; 2003-201375/19. 
XX 

PT Treatment or prevention of Alzheimer's disease involves use of bicyclo 



PT compounds . 
XX 

PS Example D; Page 60; 116pp; English. 
XX 

CC The invention comprises a method for the treatment or prevention of 

CC Alzheimer's disease. The method involves the administration of bicyclo 

CC compounds or their salts. A major factor in the development of 

CC Alzheimer's disease is the deposition of amyloid beta peptide (beta- 

CC amyloid plaques) in areas of the brain responsible for cognitive 

CC activities. Amyloid beta protein is derived by proteolysis (by 

CC secretases) of the amyloid precursor protein (APP) . The method of the 

CC invention is useful for treating and delaying the onset of Alzheimer's 

CC disease. The method is also useful for the treatment of other 

CC degenerative dementias (e.g. Parkinson's disease and cerebral amyloid 

CC angiopathy) . The method of the invention is useful for inhibiting beta- 

CC secretase activity - thereby inhibiting cleavage of an APP and inhibiting 

CC production of amyloid beta peptide. The present amino acid sequence 

CC represents a synthetic APP substrate for beta-secretase 

XX 

SQ Sequence 9 AA; 



Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 23 
ABP71269 

ID ABP71269 standard; peptide; 9 AA. 
XX 

AC ABP712 69; 
XX 

DT 28-APR-2003 (first entry) 
XX 

DE Oligopeptide substrate used for beta-secretase assay. 
XX 

KW APP; aminediol; amyloid precursor protein; beta amyloid; A-beta; 
KW beta-secretase; nootropic; neuroprotective; antiparkinsonian; 
KW cerebroprotective ; haemostatic. 
XX 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 
FT Modif ied-site 1 

FT /note= "N-terminal biotinylation' 1 

FT Cleavage-site 5. .6 

FT /note= "cleavage site for beta-secretase" 

XX 

PN WO2003006453-A1 . 
XX 

PD 23-JAN-2003. 
XX 

PF 10-JUL-2002; 2 0 02WO-US02 17 09 . 



XX 

PR 10-JUL-2001; 2 001US-0304129P . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA (PHAA ) PHARMACIA & UPJOHN CO. 

XX 

PI Schostarez HJ, Chrusciel RA; 
XX 

DR WPI; 2003-210414/20. 
XX 

PT New aminediols are beta-secretase inhibitors, useful for the treatment of 

PT e.g. Alzheimer's disease, cognitive impairment, Down's syndrome, and 

PT degenerative dementias. 
XX 

PS Example D; Page 145; 190pp; English. 
XX 

CC The invention relates to aminediol derivatives of specified formula. The 

CC compounds are capable of slowing beta amyloid (A-beta) peptide formation 

CC and/or deposition in the brain. They are effective inhibitors of beta- 

CC secretase and inhibit beta-secretase-mediated cleavage of amyloid 

CC precursor protein (APP) . Thus they effectively inhibit A-beta production 

CC and helps to reduce amyloid beta deposit or plaques. The compounds are 

CC useful in the manufacture of a medicament for treating or preventing 

CC Alzheimer's disease, mild cognitive impairment, Down's syndrome, 

CC Hereditary Cerebral Haemorrhage with Amyloidosis of the Dutch-Type, 

CC cerebral amyloid angiopathy, other degenerative dementias (including 

CC dementias of mixed vascular and degenerative origin, dementia associated 

CC with Parkinson's disease, dementia associated with progressive 

CC supranuclear palsy, dementia associated with cortical basal degeneration) 

CC or diffuse Lewy body type of Alzheimer's disease in a patient especially 

CC human. Sequences ABP71269-270 represent oligopeptide substrates 

CC containing known beta-secretase cleavage site are used to assay beta- 

CC secretase activity 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 EVNLDAEF 8 

I I I I i I I I 
Db 2 EVNLDAEF 9 



RESULT 2 4 
ABR44377 

ID ABR44377 standard; peptide; 9 AA. 
XX 

AC ABR44377; 
XX 

DT 14-JUL-2003 (first entry) 
XX 

DE Oligonpeptide #1 incorporating cleavage site of beta-secretase. 
XX 

KW Nootropic; neuroprotective; hemostatic; Alzheimer's disease; 

KW Down's syndrome; Hereditary Cerebral Hemorrhage; beta-secretase. 



XX 

OS Synthetic. 
XX 

PN WO2003027068-A2 . 
XX 

PD 03-APR-2003. 
XX 

PF 24-SEP-2002; 2 002WO-US0302 31 . 
XX 

PR 24-SEP-2001; 2 0 01US-032 44 07P . 
XX 

PA (ELAN-) ELAN PHARM INC. 
XX 

PI Gailunas A, Tucker JA, John V; 
XX 

DR WPI; 2003-421148/39. 
XX 

PT New substituted amines useful for the treatment of e.g. Alzheimer's 

PT disease, mild cognitive impairment and down's syndrome. 

XX 

PS Example D; Page 10 6; 14 0pp; English. 
XX 

CC This invention relates to substituted amines that are considered 

CC nootropic, neuroprotective and hemostatic in their action. They are 

CC useful in the manufacture of a medicament for treating or preventing 

CC Alzheimer's disease, mild cognitive impairment, Down's syndrome, 

CC Hereditary Cerebral Hemorrhage with Amyloidosis of the Dutch-Type and 

CC other degenerative dementias. The compounds are capable of slowing A-beta 

CC peptide formation and deposition in the brain and are effective 

CC inhibitors of beta-secretase that inhibit beta-secretase-mediated 

CC cleavage of APP that are effective inhibitors of A beta production and 

CC effective to reduce amyloid beta deposit or plaques. The present sequence 

CC represents a synthetic oliopeptide that incorporates the cleavage site of 

CC beta-secretase 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I II I 1 I 
Db 2 EVNLDAEF 9 



RESULT 2 5 
ABP58375 

ID ABP58375 standard; peptide; 9 AA. 
XX 

AC ABP58375; 
XX 

DT 07-APR-2003 (first entry) 
XX 

DE Beta-secretase amyloid precursor protein substrate. 
XX 

KW Beta-secretase; substrate; amyloid precursor protein; inhibitor; 



KW hydroxyl alkyl amine; Alzheimer* s disease; nootropic; neuroprotective; 

KW haemostatic; antiparkinsonian. 

XX 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 

FT Cleavage-site 5. .6 

XX 

PN WO200298849-A2 . 
XX 

PD 12-DEC-2002. 
XX 

PF 31-MAY-2002; 2 002WO-US017698 . 
XX 

PR 01-JUN-2001; 2001US-0295332P . 

PR 19-NOV-2 001; 2 001US-0332 639P . 

PR 28-DEC-2001; 2 0 0 1US- 0 34 37 7 2 P . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA (PHAA ) PHARMACIA & UPJOHN CO. 

XX 

PI Freskos J, Aquino J, Brown DL, Fang L, Fobian YM, Gailunas A; 

PI Guinn A, Varghese J, Romero AG, Tucker J, Tung J, Walker D; 
XX 

DR WPI; 2003-201287/19. 
XX 

PT New hydroxy alkyl amine compounds are beta-secretase activity inhibitors 

PT useful for treatment of e.g. Alzheimer's disease, mild cognitive 

PT impairment, Down's syndrome, hereditary cerebral hemorrhage. 
XX 

PS Example D; Page 197; 360pp; English. 
XX 

CC The present sequence is an example of a synthetic amyloid precursor 

CC protein (APP) substrate that includes a beta-secretase cleavage site. 

CC Cleavage products can be detected using HPLC, or fluorogenic or 

CC chromogenic detection methods. The invention provides novel hydroxy alkyl 

CC amine compounds that have beta-secretase inhibitor activity. Beta- 

CC secretase cleaves APP to amyloid beta peptide, found in the brains of 

CC Alzheimer's disease patients. The novel compounds halt or reduce A-beta 

CC production from APP and reduce or eliminate the formation of beta-amyloid 

CC deposits in the brain. They are useful for the manufacture of a 

CC medicament for the treatment or prevention of Alzheimer's disease, mild 

CC cognitive impairment, Down's syndrome, hereditary cerebral haemorrhage 

CC with amyloidosis of the Dutch type, cerebral amyloid angiopathy, other 

CC degenerative dementias, dementias of mixed vascular and degenerative 

CC origin, dementia associated with Parkinson's disease, dementia associated 

CC with progressive supranuclear palsy, dementia associated with cortical 

CC basal degeneration, diffuse Lewy body type of Alzheimer's disease (all 

CC claimed) and for treating f rontotemporal dementias with parkinsonism 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 



1 EVNLDAEF 8 



Db 2 EVNLDAEF 9 



RESULT 2 6 
AAO26801 

ID AAO26801 standard; peptide; 9 AA. 
XX 

AC AAO2 6801; 
XX 

DT 10-APR-2003 (first entry) 
XX 

DE Beta-secretase substrate peptide, SEQ ID No 8. 
XX 

KW Nootropic; neuroprotective; haemostatic; antiparkinsonian; aminoalcohol 

KW Alzheimer's disease; mild cognitive impairment; MCI; Down's syndrome; 

KW Hereditary Cerebral Haemorrhage; Amyloidosis; Dutch-Type; 

KW cerebral amyloid angiopathy; single; recurrent lobar haemorrhages; 

KW degenerative dementia; Parkinson's disease; cortical basal degeneration 

KW progressive supranuclear palsy; Lewy body type; beta-secretase; enzyme; 

KW substrate . 

XX 

OS Unidentified . 
XX 

PN WO2002100820-A1 . 
XX 

PD 19-DEC-2002 . 
XX 

PF ll-JUN-2002; 2 0 02WO-US 0190 67 . 
XX 

PR ll-JUN-2001; 2 0 01US- 02 97 42 OP . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA (PHAA ) PHARMACIA & UPJOHN CO. 

XX 

PI Maillard M, Tucker JA; 
XX 

DR WPI; 2003-167387/16. 
XX 

PT New substituted aminoalcohols useful for the treatment of e.g. 

PT Alzheimer 1 s disease, mild cognitive impairment, Down's syndrome or 

PT dementia associated with Parkinson's disease. 

XX 

PS Example D; Page 119; 206pp; English. 
XX 

CC The invention relates to novel substituted aminoalcohols, their esters 

CC and salts. The novel substituted aminoalcohols can be used for treating 

CC patient who has, or in preventing a patient from getting, a disease or 

CC condition (e.g. Alzheimer's disease), for helping prevent or delay the 

CC onset of the Alzheimer's disease, for treating patients with mild 

CC cognitive impairment (MCI) and preventing or delaying the onset of 

CC Alzheimer's disease in those who would progress from MCI to Alzheimer's 

CC disease, for treating Down's syndrome, for treating humans who have 

CC Hereditary Cerebral Haemorrhage with Amyloidosis of the Dutch-Type, for 

CC treating cerebral amyloid angiopathy and preventing its potential 

CC consequences i.e. single and recurrent lobar haemorrhages, for treating 

CC other degenerative dementias (including dementias of mixed vascular and 



CC degenerative origin, dementia associated with Parkinson's disease, 

CC dementia associated with progressive supranuclear palsy, dementia 

CC associated with cortical basal degeneration) or diffuse Lewy body type of 

CC Alzheimer's disease. This sequence represents a peptide substrate of beta 

CC -secretase used in an enzyme inhibition assay for analysing the 

CC inhibitory activity of the novel compounds of the invention 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 27 


ABG75940 


T T) 




XX 




AC 


ABG7 594 0 * 


XX 




DT 


0 7 -MAY— 2 0 0 3 (first ent rv ) 


XX 

A\ i * i 




DE 


Synthetic Amyloid precursor protein (APP) substrate #6. 


XX 




KW 


Amvloid Diecursor orotein ; APP ; substituted cvcl i c ami Hp : 


KW 


Alzheimer's disease; mild cognitive impairment; MCI; Down's syndrome; 


KW 


hereditary cerebral haemorrhage with Amyloidosis of the dutch type; 


KW 


cerebral amyloid angiopathy; degenerative dementia; dementia; 


KW 


diffuse Lewy body type of Alzheimer's disease; Parkinson's disease; 


KW 


progressive supernuclear palsy; cortical basal degeneration; 


KW 


beta-secretase inhibitor. 


XX 




OS 


Synthetic . 


XX 




PN 


WO2002100856-A1 . 


XX 




PD 


19-DEC-2002 . 


XX 




PF 


12-JUN-2 0 02 ; 2 002WO-US0 1907 6 . 


XX 




PR 


12-JUN-2 0 01; 2 001US-02 9754 6P . 


PR 


19-NOV-2 0 01; 2 001US-033308 3P . 


XX 




PA 


(ELAN-) ELAN PHARM INC. 


PA 


( PHAA ) PHARMACIA & UPJOHN CO. 


XX 




PI 


Pulley SR, Beck JP, Tenbrink RE; 


XX 




DR 


WPI; 2 0 03-2 7 822 3/2 7 . 


XX 




PT 


New substituted cyclic amides useful as beta-secretase and amyloid 


PT 


precursor protein inhibitors for treating e.g. Alzheimer's disease and 


PT 


Down ' s syndrome . 



XX 

PS Example D; Page 113; 164pp; English. 
XX 

CC The invention describes substituted cyclic amides (I) . (I) are useful in 

CC the manufacture of a medicament for treating, preventing and delaying 

CC Alzheimer's disease, for treating mild cognitive impairment (MCI), Down's 

CC syndrome, hereditary cerebral haemorrhage with Amyloidosis of the dutch 

CC type, cerebral amyloid angiopathy, other degenerative dementias and 

CC diffuse Lewy body type of Alzheimer's disease. (I) is also useful for 

CC treating dementia associated with Parkinson's disease, progressive 

CC supernuclear palsy and cortical basal degeneration. The compounds are 

CC capable of slowing A-beta peptide formation and/or deposition in the 

CC brain and are effective inhibitors of beta-secretase, that inhibit beta- 

CC secretase-mediated cleavage of APP, that are effective inhibitors of A 

CC beta production and/ or effective to reduce amyloid beta deposit or 

CC plaques . This is the amino acid sequence of an amyloid precursor protein 

CC (APP) substrate that can be cleaved by beta-secretase and is therefore 

CC used to assay beta-secretase activity in the presence and absence of the 

CC inhibitory compounds of the invention 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 2 8 
ABP71630 

ID ABP71630 standard; peptide; 9 AA. 
XX 

AC ABP71630; 
XX 

DT 29-MAY-2003 (first entry) 
XX 

DE Beta-secretase activity assaying oligopeptide substrate. 
XX 

KW APP; diaminediol; A-beta; beta-secretase; amyloid precursor protein; 

KW beta amyloid; nootropic; neuroprotective; antiparkinsonian; haemostatic. 

XX 

OS Synthetic. 
XX 

FH Key Location/Quali f iers 

FT Modif ied-site 1 

FT /note= "N-terminal biotin" 

FT Cleavage-site 5. .6 

FT /note= "beta-secretase cleavage site" 

XX 

PN WO2 003 006013-A1 . 
XX 

PD 23- JAN-2003 . 
XX 

PF 10-JUL-2002; 2002WO-US022459 . 



XX 

PR 10-JUL-2001; 2 0 0 1US-0304305P . 

PR 30-NOV-2001; 2 0 01US-0334 4 8 OP . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA (PHAA ) PHARMACIA & UPJOHN CO. 

XX 

PI Schostarez HJ, Chrusciel RA; 
XX 

DR WPI; 2003-289727/28. 
XX 

PT New diaminecliols useful for the treatment of e.g. Alzheimer's disease. 
XX 

PS Example D; Page 139; 182pp; English. 
XX 

CC The invention relates to diaminediol derivatives of specified formula. 

CC The compounds are capable of slowing A-beta peptide formation and/or 

CC deposition in the brain and are effective inhibitors of beta-secretase. 

CC They can inhibit beta-secretase-mediated cleavage of APP (amyloid 

CC precursor protein) and are effective inhibitors of A beta production and/ 

CC or effective to reduce amyloid beta deposit or plaques . The compounds are 

CC useful in the manufacture of a medicament for treating or preventing e.g. 

CC Alzheimer's disease, mild cognitive impairment, Down's syndrome, 

CC Hereditary Cerebral Hemorrhage with Amyloidosis of the Dutch-Type, 

CC cerebral amyloid angiopathy, other degenerative dementias (including 

CC dementias of mixed vascular and degenerative origin, dementia associated 

CC with Parkinson's disease, dementia associated with progressive 

CC supranuclear palsy, dementia associated with cortical basal degeneration) 

CC or diffuse Lewy body type of Alzheimer's disease in a patient e.g. human. 

CC Sequences ABP71630-631 represent synthetic APP oligopeptide substrates 

CC containing the known beta-secretase cleavage site and are used to assay 

CC beta-secretase activity 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

j I I M I I I 
Db 2 EVNLDAEF 9 



RESULT 2 9 
AAE36000 

ID AAE36000 standard; peptide; 9 AA. 
XX 

AC AAE36000; 
XX 

DT 26-JUN-2003 (first entry) 
XX 

DE APP substrate #6 used in the exemplification of the invention. 
XX 

KW Aminediol compound; beta secretase inhibitor; Alzheimer's disease; AD; 

KW Hereditary Cerebral Haemorrhage with Amyloidosis of Dutch-Type; HCHWA-D; 

KW cognitive impairment; cerebral amyloid angiopathy; Down's syndrome; APP; 



KW 
KW 
XX 
OS 
XX 
FH 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
XX 
PA 
PA 
XX 
PI 
XX 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 
CC 

cc 

CC 

cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



amyloidosis; cortical basal degeneration; Parkinson's disease; therapy; 
dementia; supranuclear palsy; amyloid precursor protein. 

Unidentified . 

Key Location/Qualifiers 
Cleavage-site 5 . .6 

WO2002100818-A2. 
19-DEC-2002 . 

13-JUN-2 002 ; 2 002WO-US018 84 5 . 

13-JUN-2 001; 2 0 01US-02 97 82 7 P . 
19-NOV-2 001; 2 001US-0333 084P . 

(ELAN-) ELAN PHARM INC. 

(PHAA ) PHARMACIA & UPJOHN CO. 

Schos tarez H J, Chrus ciel RA; 

WPI; 2003-201321/19 . 

New aminediol compounds are beta secretase inhibitors used for treating 
e.g. Alzheimer's disease and dementia associated with Alzheimer's 
disease . 

Example C; Page 160; 213pp; English. 

The invention relates to aminediol compounds including beta secretase 
inhibitors , used for treating and preventing Alzheimer's disease (AD) , 
mild cognitive impairment , Down's syndrome, Hereditary Cerebral 
Haemorrhage with Amyloidosis of Dutch-Type (HCHWA-D) , cerebral amyloid 
angiopathy, other degenerative dementias, dementias of mixed vascular and 
degenerative origin, dementia associated with Parkinson's disease, 
dementia associated with progressive supranuclear palsy, dementia 
associated with cortical basal degeneration and diffuse Lewy body type of 
Alzheimer's disease. The present sequence is APP (amyloid precursor 
protein) substrate used in the exemplification of the invention 

Sequence 9 AA; 



Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pied. No. 1.4e+06; 
Matches 8; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 EVNLDAEF 8 



2 EVNLDAEF 9 



RESULT 30 
ABR82372 

ID ABR82372 standard; peptide; 9 AA. 
XX 

AC ABR82372; 



XX 

DT 06-NOV-2003 (first entry) 
XX 

DE Beta-secretase cleavage site containing oligopeptide substrate. 
XX 

KW APP; amino carboxamide ; nootropic; neuroprotective; haemostatic; 
KW antiparkinsonian; beta-secretase; amyloid precursor protein. 
XX 

OS Synthetic. 
XX 

FH Key Location/ Qualifiers 

FT Modif ied-site 1 

FT /note= "N-terminal biotinylation" 

FT Cleavage-site 5. .6 

FT /note= "beta-secretase cleavage site" 

XX 

PN WO2 0 03 05 7 72 1-A2 . 
XX 

PD 17- JUL-2003 . 
XX 

PF 06-JAN-2003; 2 0 03WO-US00032 6 . 
XX 

PR 04-JAN-2002; 2 002US-0345316P . 
PR 18-JAN-2002; 2 002US-03504 19P . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA (PHAA ) PHARMACIA & UPJOHN CO. 

XX 

PI Jagodzinska B, Warpehoski MA; 
XX 

DR WPI; 2003-645986/61. 
XX 

PT New substituted amino carboxamide compounds are beta secretase inhibitors 
PT used for treating e.g. Alzheimer's disease and mild cognitive impairment 
PT and Down's syndrome. 
XX 

PS Example D; Page 101; 133pp; English. 
XX 

CC The invention relates to substituted amino carboxamide compounds of 

CC specified formula. The compounds can inhibit the activity beta-secretase 

CC and can also inhibit amyloid precursor protein (APP) cleavage. They can 

CC be used for treatment or prevention of Alzheimer's disease, mild 

CC cognitive impairment, Down ' s syndrome, hereditary cerebral haemorrhage 

CC with amyloidosis of the Dutch-type, cerebral amyloid angiopathy, other 

CC degenerative dementias, dementias of mixed vascular and degenerative 

CC origin, dementia associated with Parkinson's disease, dementia associated 

CC with progressive supranuclear palsy, dementia associated with cortical 

CC basal degeneration, or diffuse Lewy body type of Alzheimer's disease. The 

CC present sequence represents a synthetic oligopeptide substrate containing 

CC the known cleavage site of beta-secretase cleavage site 

XX 

SQ Sequence 9 AA; 



Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 
Matches 8; Conservative 0; Mismatches 0; Indels 



0; 



Gaps 



0; 



Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 31 
ABR56258 

ID ABR56258 standard; peptide; 9 AA. 
XX 

AC ABR562 58; 
XX 

DT 20-NOV-2003 (first entry) 
XX 

DE Amyloid Precursor Protein substrate #6. 
XX 

KW APP; nootropic; neuroprotective; cerebroprotective ; Alzheimer's disease; 

KW beta-secretase ; Amyloid Precursor Protein. 

XX 

OS Synthetic . 
XX 

FH Key Location/Qualifiers 
FT Modif ied-site 1 

FT /note= "Biotin-Ser" 

FT Cleavage-site 5 . .6 

XX 

PN WO2003043618-A2 . 
XX 

PD 30-MAY-2003. 
XX 

PF 19-NOV-2002; 2002WO-US037180 . 
XX 

PR 19-NOV-2001; 2 00 1US- 0332 8 63P . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA (PHAA ) PHARMACIA & UPJOHN. 

XX 

PI Schostarez HJ, Hanson GJ; 
XX 

DR WPI; 2003-532662/50. 
XX 

PT Treatment of Alzheimer's disease involves the use of amine diols or their 

PT salts. 

XX 

PS Example D; Page 227; 270pp; English. 
XX 

CC The present invention relates to novel amine diols and their salts, which 
CC can be used for the treatment of Alzheimer's disease. The compounds are 
CC inhibitors of beta-secretase, which cleaves Amyloid Precursor Protein 
CC (APP) to produce A beta peptide, a major component of the amyloid plaques 
CC found in the brains of Alzheimer's sufferers. APP substrates (ABR56251- 
CC ABR56259) that can be cleaved by beta-secretase, were used to assay beta- 
CC secretase activity in the presence or absence of the compounds of the 
CC invention 
XX 

SQ Sequence 9 AA; 



Query Match 



100.0%; Score 8; DB 6; Length 9; 



Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 



1 EVNLDAEF 8 



Db 



2 EVNLDAEF 9 



RESULT 32 
ABR62018 

ID ABR62018 standard; peptide; 9 AA. 
XX 

AC ABR62 018; 
XX 

DT 03-OCT-2003 (first entry) 
XX 

DE Beta-secretase cleavage site containing oligopeptide substrate. 
XX 

KW APP; Alzheimer ' s disease; nootropic; neuroprotective; cerebroprotective; 
KW beta amyloid peptide; amyloid precursor protein; beta-secretase; A beta; 
KW haemostatic . 
XX 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 
FT Modif ied-site 1 

FT /note= "N-terminal biotinylation" 

FT Cleavage-site 5. .6 

XX 

PN W02 003047 57 6-A1 . 
XX 

PD 12- JUN-2003 . 
XX 

PF 03-DEC-2002; 2002WO-US040038 . 
XX 

PR 04-DEC-2001; 2 001US-033 6566P . 
XX 

PA (ELAN-) ELAN PHARM INC. 
XX 

PI Varghese J; 
XX 

DR WPI; 2003-558913/52. 
XX 

PT Treating Alzheimer ' s disease, mild cognitive impairment and other similar 
PT diseases comprises administering beta secretase inhibiting heterocyclic 
PT compounds . 



XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 
CC 



The invention relates to treating or preventing Alzheimer's disease and 
related diseases and involves administering heterocyclic compounds of 
specified formula. The compounds can inhibit the activity of beta- 
secretase. It can also inhibit the production and/or deposition of beta 
amyloid peptide (A beta) in the brain and inhibit beta-secretase mediated 
cleavage of amyloid precursor protein (APP) . The compounds can be used 
for treating, preventing or delaying the onset of Alzheimer's disease, 
for treating mild cognitive impairment (MCI) and preventing or delaying 



Example D; Page 65; 149pp; English. 



CC 
CC 



CC the onset of Alzheimer's disease in those with MCI, for treating Down's 

CC syndrome, hereditary cerebral haemorrhage with amyloidosis of the Dutch 

CC type, cerebral amyloid angiopathy and preventing its potential 

CC consequences i.e. single and recurrent lobar haemorrhages, other 

CC degenerative dementias (including dementias of mixed vascular and 

CC degenerative origin, dementia associated with Parkinson's disease, 

CC dementia associated with progressive supranuclear palsy, dementia 

CC associated with cortical basal degeneration) or diffuse Lewy body type 

CC Alzheimer's disease. The present sequence represents a synthetic 

CC oligopeptide substrate containing the known cleavage site of beta- 

CC secretase, used to assay beta-secretase activity 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 33 
ABR61887 

ID ABR61887 standard; peptide; 9 AA. 
XX 

AC ABR61887; 
XX 

DT 12-SEP-2003 (first entry) 
XX 

DE Beta-secretase cleavage site containing oligopeptide substrate. 
XX 

KW APP; Alzheimer's disease; beta-secretase; beta amyloid peptide; A beta; 

KW amyloid precursor protein; nootropic; neuroprotective. 

XX 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 
FT Modif ied-site 1 

FT /note- "N-terminal biotin" 

FT Cleavage-site 5 . .6 

FT /note= "beta-secretase cleavage site" 

XX 

PN WO2003045378-A1 . 
XX 

PD 05-JUN-2003. 
XX 

PF 21-NOV-2002; 2 0 02WO-US037360 . 
XX 

PR 21-NOV-2001; 2 0 01US- 0334 692 P . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA ( PHAA ) PHARMACIA & UPJOHN CO. 

XX 

PI John V; 
XX 



DR WPI; 2003-577200/54. 
XX 

PT Use of amino acid derivatives for e.g. treating mild cognitive impairment 

PT and treating, preventing or delaying the onset of the Alzheimer's 

PT disease. 
XX 

PS Example D; Page 53; 204pp; English. 
XX 

CC The invention relates to the treatment of Alzheimer's disease (AD) and 

CC involves administration of amino acid derivatives of specified formula. 

CC The derivatives act as inhibitors of beta-secretase and inhibits the 

CC production and deposition of beta Amyloid peptide (A beta) . They also 

CC inhibit the cleavage of amyloid precursor protein (APP) . The derivatives 

CC can be used for treating, preventing or delaying the onset of AD, for 

CC treating mild cognitive impairment (MCI) and preventing or delaying the 

CC onset of AD in those who would progress from MCI to AD; treating Down's 

CC syndrome, hereditary cerebral haemorrhage with amyloidosis of the Dutch- 

CC type, cerebral amyloid angiopathy and preventing its potential 

CC consequences i.e. single and recurrent lobar haemorrhages, other 

CC degenerative dementias (including dementias of mixed vascular and 

CC degenerative origin, dementia associated with Parkinson's disease, 

CC dementia associated with progressive supranuclear palsy, dementia 

CC associated with cortical basal degeneration) or diffuse Lewy body type of 

CC Alzheimer's disease. The present sequence represents a synthetic 

CC oligopeptide substrate containing the known beta-secretase cleavage site, 

CC used to assay beta-secretase activity 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 6; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I i I I I I M 
Db 2 EVNLDAEF 9 



RESULT 34 


ABR56195 


ID 


ABR56195 standard; peptide; 9 AA. 


XX 




AC 


ABR56195; 


XX 




DT 


18-DEC-2003 (first entry) 


XX 




DE 


Amyloid precursor protein, APP, substrate #7 . 


XX 




KW 


APP; nootropic; neuroprotective; cerebroprotective ; haemostatic; 


KW 


Alzheimer's disease; beta-secretase; Amyloid Precursor Protein; APP; 


KW 


degenerative dementia. 


XX 




OS 


Synthetic . 


XX 




FH 


Key Location/Qualifiers 
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Modif ied-site 1 


FT 


/note= "N-terminal Biotin" 
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cc 


sufferers . The method is also useful for treating mild cognitive 


cc 


impairment, Down's disease, hereditary cerebral haemorrhage with 


cc 


amyloidosis of the Dutch-type, cerebral amyloid angiopathy and other 


cc 


degenerative dementias. The present sequence is a synthetic APP substrate 


cc 


which can be cleaved by beta-secretase. This peptide was used to assay 


cc 


beta-secretase activity in the presence or absence of the inhibitory 


cc 


compounds of the invention 


XX 




SQ 


Sequence 9 AA; 


Query Match 100.0%; Score 8; DB 7; Length 9; 


Best Local Similarity 100.0%; Pred. No. 1.4e+06; 


Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 


Qy 


1 EVNLDAEF 8 


Db 


1 1 II It 1 

2 EVNLDAEF 9 


RESULT 35 


ADC29723 


ID 


ADC29723 standard; peptide; 9 AA. 


XX 




AC 


ADC29723; 


XX 




DT 


18-DEC-2003 (first entry) 


XX 




DE 


Synthetic beta-secretase oligopeptide substrate #1. 


XX 





GenCore version 5.1*6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



March 26, 2004, 15:35:53 



Search time 35.5 Seconds 

(without alignments) 

58.966 Million cell updates/sec 



Title : 



Sequence : 



US-09-668-314C-71 



Perfect score: 8 



1 EVNLDAEF 8 



Scoring table: OLIGO 

Gapop 60.0 , Gapext 60.0 



Searched : 



1065169 seqs, 261661801 residues 



Word size : 



0 



Total number of hits satisfying chosen parameters 



1065169 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Listing first 1000 summaries 



Database 



Publ 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 



/ 
/ 
/ 
/ 
/ 
/ 
/ 
/ 
/ 



ished_Applications_AA: * 

cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep : * 
cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 
cgn2__6/ptodata/2/pubpaa/US06_NEW_PUB.pep: * 
cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: * 
cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep: * 
cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep : * 
cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB . pep : * 
cgn2_6/pt odat a/ 2 /pubpaa/US 0 8_PUBCOMB . pep : * 
cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep : * 
/cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep: * 
/cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep : * 
/cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB. pep : * 
/cgn2_6/ptodata/2/pubpaa/USlOA_PUBCOMB.pep : * 
/cgn2_6/ptodata/2/pubpaa/USl0B_PUBCOMB.pep : * 
/cgn2_6/ptodata/2/pubpaa/USlOC_PUBCOMB.pep : * 
/cgn2_6/ptodata/2/pubpaa/USlO_NEW_PUB.pep: * 
/cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep: * 
/cgn2 6/ptodata/2/pubpaa/US6 0_PUBCOMB . pep : + 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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4 


8 


100 . 


o 


9 


10 


US-09-895-871-8 


Sequence 8, Appli 


5 


8 


100 . 


o 


9 


12 


US-10-299-739-8 


Sequence 8, Appli 


6 


8 


100 . 


o 


9 


14 


US-10-192-625-8 


Sequence 8, Appli 


7 


8 


100 . 


o 


9 


14 


US-10-192-424-8 


Sequence 8, Appli 




8 


100 . 

_i_ \J \J a 


o 


9 


14 


US-10-183-126A-8 


Sequence 8, Appli 


9 


8 


100 . 


o 


9 


14 


US-10-171-343-8 


Sequence 8, Appli 


10 


R 


100 

-x» v \y • 


o 


9 


14 


US-10-264-707-8 


Sequence 8, Appli 




fi 


100 


o 


9 


14 


US-10-066-319-4 


Sequence 4, Appli 


1 9 


8 


100 . 


o 

vy 


9 


14 


US-10-337-075-8 


Sequence 8, Appli 
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R 


100 


o 


10 

_l. vy 


9 


US-0 9- 794-925- 63 


Sequence 63, Appl 
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5 


382 


12 


US-10-2 82-122A-7 085 6 


Sequence 


70856, A 


714 


5 


62 . 


5 


388 


12 


US- 10- 42 5-114-53241 


Sequence 


53241, A 


715 


5 


62 . 


5 


390 


9 


US-0 9-8 15-242-577 0 


Sequence 577 0, Ap 


716 


5 


62 . 


5 


390 


12 


US-10-2 82-122A-4 4 0 08 


Sequence 


44008, A 


717 


5 


62 . 


5 


390 


12 


US- 10-424-599-24 6963 


Sequence 


246963, 


718 


5 


62 . 


5 


391 


16 


US-10-344-738-91 


Sequence 


91, Appl 


719 


5 


62 . 


5 


392 


9 


US- 0 9- 8 15-2 4 2- 12 72 3 


Sequence 12723, A 


720 


5 


62 . 


5 


392 


12 


US- 10-42 4-5 99-2 4 6958 


Sequence 


246958, 


721 


5 


62 . 


5 


393 


12 


US-10-2 82-122A-7150 6 


Sequence 


71506, A 


722 


5 


62 . 


5 


394 


12 


US- 10-2 82-122A-7 0732 


Sequence 


70732, A 


723 


5 


62 . 


5 


397 


12 


US-10-2 82-122A-69113 


Sequence 


69113, A 


724 


5 


62 . 


5 


398 


14 


US-10-166-225A-74 


Sequence 


74, Appl 


725 


5 


62 . 


5 


399 


12 


US- 1 0-2 82- 12 2A- 6165 4 


Sequence 


61654, A 


726 


5 


62 . 


, 5 


399 


12 


US-10-2 8 2-122A-63635 


Sequence 


63635, A 


727 


5 


62 . 


, 5 


399 


12 


US-10-2 82-122A-7 652 4 


Sequence 


76524, A 


728 


5 


62 . 


, 5 


402 


9 


US-09-712-363-147 


Sequence : 


147, App 


729 


5 


62 . 


, 5 


402 


12 


US-10-2 82-122A-62 77 3 


Sequence 


62773, A 


730 


5 


62 , 


. 5 


402 


12 


US-10-2 8 2-122A-64 32 6 


Sequence 


64326, A 


731 


5 


62 . 


. 5 


406 


12 


US- 1 0-2 82- 12 2A- 5 94 66 


Sequence 


59466, A 


732 


5 


62 . 


. 5 


410 


14 


US- 10- 15 6-7 6 1-82 5 8 


Sequence 


8258, Ap 


733 


5 


62 . 


. 5 


411 


14 


US- 10-2 38-07 5-1102 


Sequence 


1102, Ap 


734 


5 


62 . 


.5 


417 


12 


US- 10-425- 114-55 98 9 


Sequence 


55989, A 


735 


5 


62 . 


. 5 


418 


12 


US-10-2 82-122A-7 64 97 


Sequence 


76497, A 


736 


5 


62 . 


. 5 


422 


15 


US- 10-2 64-2 37-2 090 


Sequence 


2090, Ap 


737 


5 


62 , 


. 5 


423 


12 


US- 10-42 4 -5 99-2 4 6959 


Sequence 


246959, 


738 


5 


62 , 


. 5 


423 


12 


US- 10- 4 25-1 14-4 97 43 


Sequence 


49743, A 


739 


5 


62 


.5 


424 


10 


US- 0 9- 9 33 -7 67 -107 9 


Sequence 


1079, Ap 


740 


5 


62 


.5 


424 


14 


US-10-023-282-1079 


Sequence 


1079, Ap 



741 


5 


62 . 


5 


426 


12 


US-10-425-114-57105 


Sequence 


57105, A 


742 


5 


62 . 


5 


427 


15 


US- 10- 3 69- 4 93- 18 68 8 


Sequence 


18688, A 


743 


5 


62 . 


5 


428 


15 


US- 10-3 69-4 93-2 0075 


Sequence 


20075, A 


744 


5 


62 . 


5 


442 


12 


US-10-2 82-122A-64111 


Sequence 


64111, A 


745 


5 


62 . 


5 


443 


14 


US-10-15 6-7 61-13 902 


Sequence 


13902, A 


746 


5 


62 . 


5 


446 


9 


US-09-824-735-2 


Sequence 2 , Appli 


747 


5 


62 . 


5 


448 


9 


US-09-8 15-242-12 010 


Sequence 12010, A 


74 8 


5 


62 . 


5 


448 


12 


US- 10-2 82 -122A- 66652 


Sequence 


66652, A 


74 9 


5 


62 . 


5 


452 


9 


US-09-865-415-6 


Sequence 6, Appli 


750 


5 


62 . 


5 


452 


15 


US-10-376-947-8 


Sequence 


8, Appli 


751 


5 


62 . 


5 


457 


15 


US- 10-369-4 9 3-630 8 


Sequence 


6308, Ap 


752 


5 


62 . 


5 


460 


12 


US- 10- 4 17 -7 0 OA- 137 


Sequence 


137, App 


753 


5 

**** 


62 . 


5 


4 62 


15 


US-10-369-4 93-2 32 8 8 


Sequence 


23288, A 


7 54 


5 


62 . 


5 


463 


14 


US-10-156-7 61-7 5 93 


Sequence 


7593, Ap 


7 5 5 


5 


62 . 


5 


464 


12 


US- 10- 4 2 4-599-2 0 668 4 


Sequence 


206684, 


7 5 6 


5 


62 . 


5 


471 


15 


US-10-10 8-2 60A-258 8 


Sequence 


2588, Ap 


757 


5 


62 . 


5 


479 


15 


US- 10-3 69- 4 9 3-4 62 2 


Sequence 


4622, Ap 


7 58 

/ w^ w 


5 


62 . 


5 


491 


15 


US- 10- 3 69- 4 93-7 38 0 


Sequence 


7 3 8 0, Ap 


7 5 9 


5 


62 . 


5 


497 


12 


US-10-2 82-122A-552 61 


Sequence 


55261, A 


7 fin 

/WW 


5 


62 . 


5 


506 

w^ \y \^ 


12 


US-10-424-5 99-14 4132 


Sequence 


144132, 


7 61 


5 


62 


5 


506 


15 


US- 10-369-4 93- 12 5 02 


Sequence 


12502, A 


7 6? 


5 


62 


5 


507 


12 


US- 10- 42 5- 114 -695 53 


Sequence 


69553, A 


7 63 

/ W o 


5 


62 


5 


507 


14 


US- 10- 156-7 61-12772 


Sequence 


12772, A 


7 64 




62 


5 


509 


12 


US-10-424-5 99-22 94 8 8 


Sequence 


229488, 


7 fi 5 


5 


62 


5 


519 


14 


US-10-113-7 94A-2 


Sequence 


2, Appli 


7 66 


5 


62 


5 


519 


15 


US-10-428-487-14 


Sequence 


14, Appl 


7 67 

/ W / 


5 


62 . 


5 


529 


15 


US-1 0-320-7 97-322 9 


Sequence 


3229, Ap 


7 68 

/WW 


5 


62 . 


5 


532 


9 


US-0 9-8 15-2 42-11727 


Sequence 11727, A 


7 fi Q 




62 


5 


533 


12 


US-10-2 82-122A-52 732 


Sequence 


52732, A 


7 7 0 


5 


62 . 


5 


533 


15 


US-1 0-3 69-4 93-19137 


Sequence 


19137, A 


111 


5 


62 


5 


535 


12 


US-10-2 82-122A-51874 


Sequence 


51874, A 


11? 


5 


62 


5 


537 


15 


US- 10- 3 6 9- 4 9 3- 8 02 4 


Sequence 


8024, Ap 


7 7 3 


5 


62 . 


5 


545 


12 


US-10-424-5 99-2 4 6964 


Sequence 


246964, 


11 A 


5 


62 


5 


551 


14 


US- 10- 02 9-3 8 6-338 57 


Sequence 


33857, A 


77 5 


5 


62 


5 


567 


15 


US-1 0-1 08-2 60A-4 022 


Sequence 


4022, Ap 


116 


5 


62 . 


5 


571 


12 


US- 10- 112- 94 4-3 64 


Sequence 


364, App 


111 


5 


62 . 


5 


579 


15 


US- 10-3 69-4 93-2 1556 


Sequence 


21556, A 


118 


5 


62 . 


5 


580 


15 


US- 10-369-4 93- 132 8 


Sequence 


1328, Ap 


7 7 9 


5 


62 . 


5 


581 


14 


US- 10-156-7 61-9697 


Sequence 


9697, Ap 


7 8 0 


5 


62 . 


5 


591 


9 


US-09-8 15-2 42-10419 


Sequence '. 


10419, A 


781 


5 


62 . 


5 


591 


12 


US-10-2 82-122A-42 68 9 


Sequence 


42689, A 


7 82 

/ \J Li 


5 


62 . 


5 


591 


14 


US-10-287-274-425 


Sequence 


425, App 


783 


5 


62 . 


, 5 


591 


15 


US-1 0-1 0 8-2 60A- 3 97 0 


Sequence 


3970, Ap 


7 84 


5 


62 . 


. 5 


597 


12 


US-10-2 82-122A-7 68 83 


Sequence 


76883, A 


7 85 


5 


62 . 


, 5 


599 


12 


US- 10-424-5 99-2 667 97 


Sequence 


266797, 


7 8 6 


5 


62 . 


, 5 


600 


12 


US- 10-425- 114-55322 


Sequence 


55322, A 


787 


5 


62 . 


. 5 


606 


14 


US-10-12 8-714-3 0 64 


Sequence 


3064, Ap 


7 8 8 


5 


62 . 


. 5 


607 


9 


US-0 9-8 15-2 42-137 91 


Sequence 


13791, A 


7 8 9 


5 


62 . 


. 5 


607 


12 


US-10-2 82-122A-56002 


Sequence 


56002, A 


790 


5 


62 . 


.5 


607 


12 


US-10-2 82-122A-594 7 4 


Sequence 


59474, A 


791 


5 


62 . 


. 5 


607 


12 


US- 10 — 2 8 2- 122A- 7 5 02 8 


Sequence 


75028, A 


792 


5 


62 . 


. 5 


607 


12 


US-10-2 82-122A-7 6059 


Sequence 


76059, A 


793 


5 


62 . 


. 5 


607 


12 


US-10-2 82-122A-7 8 019 


Sequence 


78019, A 


794 


5 


62 . 


. 5 


627 


14 


US-10-153-668-136 


Sequence 


136, App 


795 


5 


62 , 


. 5 


627 


14 


US-10-128-714-3118 


Sequence 


3118, Ap 


796 


5 


62 , 


,5 


637 


12 


US-10-344-741-22 


Sequence 


22, Appl 


797 


5 


62 , 


.5 


639 


12 


US- 1 0-2 8 2- 12 2A- 633 4 6 


Sequence 


63346, A 



7 9ft 


s 


62 

\J Z_« » 


R 

* ' 


64 0 


10 


US- 


-09-94 9-02 9-106 


Sequence 


106, App 




s 


vJ Z_« * 


R 
-_v 


670 


12 


us- 


-10-42 4-5 9 9-225919 


Sequence 


225919, 


ROD 
o \j \j 


s 


62 

\J Zl. • 


R 


676 


15 


us- 


-10-369-4 93-2240 


Sequence 


2240, Ap 


ft n 1 

O VJ X 




69 


R 


68 0 

U U v 


14 


us- 


-10-32 3-02 OA- 2 


Sequence 


2, Appli 


ft n ? 


r 


62 


R 


693 


9 


US- 1 


39-752-639-154 


Sequence 154, App 


ft n "3 

quo 




62 


R 


693 


9 


US — 1 


39-984-198-154 


Sequence 154, App 


ft n 4 

Q U 4 


r 


62 


R 
— ' 


693 


12 

_iw z_» 


us- 


-10-424-599-268383 


Sequence 


268383, 


r n r 

O U O 


r 

-J 


62 


R 


693 


12 


us- 


-10-425-114-45527 


Sequence 


45527, A 


ft 0 

DUD 


R 


O • 


R 


69 S 


1 s 




X'J T Li / C_i \J \J 


Sequence 


28, Appl 


ft 07 


s 

vj 


69 


R 
•J 


7 02 


1 2 


US- 


-10-2 82-122A-4 734 6 


Sequence 


47346, A 


ft o ft 


r 


62 

VJ J^. • 


R 


7 09 


12 


US- 


-10-424-599-268381 


Sequence 


268381, 


ft o 9 

O U Z7 


R 


62 


R 


71 0 


9 


US— I 


39-97 0-711-16 


Sequence 16, Appl 


pi n 

OIU 


R 


o z • 


R 
> 


71 4 


1 2 


US- 


-1 0-282-1 22A-52138 


Sequence 


52138, A 


ft 1 1 

D X X 


R 


(^9 
oz • 


R 


7 9 9 


1 s 




- 1 0-369-493-4138 


Sequence 


413 8, Ap 


ft 1 9 


R 


uz. • 


R 
»J 


7 ? n 


1 4 


US- 


-1 0-1 S3-668-4 62 


Sequence 


4 62 , App 


ft 1 "3 


r 




R 


7 39 


g 


US — 1 


09-738-62 6-6988 


Sequence 6988, Ap 


ft 1 4 
o J. *± 


r 


£9 

Oil. • 


R 
-J 


74 6 


1 2 


us- 


-1 0-425-1 14-64987 

l v/ T. ZU *J _L. ^x VJ ^ — ' vj / 


Sequence 


64987, A 


ft 1 R 


R 


f^9 
o z • 


R. 


7 R 9 


1 2 


'TIS- 


_ i 0-424-S99-192318 


Sequence 


192318, 


O X O 


r 


£9 
OZ . 


R 


7^9 


1 4 


TTC1- 
O o 


_ i o_i 9ft-71 4-ftl 1 ft 

XVJ X/1.1J /XT sJXX*J 


Sequence 


8118, Ap 


ft 1 7 
OX/ 


R 


^ 9 
O Z . 


R 




1 2 


nq. 


_ i o-9ft9-1 ?2 A— 4 8 639 


Sequence 


48639, A 


ft 1 ft 




{^9 
oz • 


R 


7 ft ft 


1 9 


tiq. 


-1 0-282-1 2 2A-73796 


Sequence 


73796, A 


O X 


R 


f^9 
oz . 


R 


7 9 S 


1 2 

-L Z. 




-1 0-425-1 1 4-70962 

_L vJ z_» X J. ^ / vj _J VJ Zl- 


Sequence 


70962, A 


ft 9 n 


R 


(^9 

O • 


R 


7 96 
/ _? o 


1 4 


US- 


-1 0-032-S8 5-72 01 

J_ V> VJ ZL, ^JT VJ ^ z_l vJ _l_ 


Sequence 


7201, Ap 


Q 9 1 
Oil 


O 


^9 
o z . 


R 


ft n f=; 

DUO 


1 9 
x z 




_09 — ft ^ ^-4 fi6— 1 3 

VJw? O O *i\J\J XJ 


Sequence 


13, Appl 


ft 9 9 


R 

•O 


f^9 
o z • 


R 


ft n 6 

u u u 


1 4 


US 


-1 0-1 9 9-8 69-6 


Sequence 


6, Appli 


ft 9 "3 


R 


£9 
o z . 


R 


ft 1 S 


1 9 

X i 


US- 


-1 0 — 282-12 2 A- 668 81 

J_ vj Z_i VJ ZL, _Lr Z_, Z_- jTv VJ VJ VJ VJ J_ 


Sequence 


66881, A 


ft 9 4 
O Z fl 


r 


£9 
o z • 


R 


ft 1 9 

(J A. J 


1 9 


US 

iJ 


_X0-282-12 2 A- 7 7 8 57 

J_ VJ VJ Z_rf J_ Z_* ZL. J L / I VJ ^ / 


Sequence 


77857, A 


ft 9 R 
o Z O 


R 


£9 
o z • 


R 


ft 1 9 


1 4 

X ^ 


us 


-1 0-1 28-714-3429 

_1_ VJ J_ Zl* VJ J _t X J X ^ 


Sequence 


3429, Ap 


ft 9 £ 
O Z O 


R 


f^9 
oz . 


R 


ft 9 S 


1 4 
x 


US- 


_-| 0-1 98-71 4-8429 


Sequence 


8429, Ap 


ft 9 7 
O Z / 


R 


f^9 
oz • 


R 


ft 9 7 


1 4 

X 


US- 


-10-101-4 64A-915 

A. VJ _l_ VJ J_ " \J x x!l J X. 


Sequence 


915, App 


ft 9 ft 
O Z O 


R 


^9 


R 


ft ? 3 


1 R 

X J 


US' 


-1 0-369-4 93-2464 

_i. Vj — j vj w/ x s-J ' z_ x vj x 


Sequence 


2464, Ap 


ft 9 Q 

O Z J 


R 


69 
o z • 


R 


ft ^ ft 

{J J (J 


1 2 

X i 


us 


-10-17 7-68 7-4 

_1_ VJ J_ f / V_V VJ / X 


Sequence 


4, Appli 


ft "3 n 

O J u 


R 

O 


£9 
o z . 


R 


ft 4 1 

(J *i J_ 


Q 
_/ 


US — 

VJ kJ 


09-ft6i-4R 1A-3 0 

VJ wJ VJ VJ J_ x kj _1_ x V VJ 


Sequence 30 , Appl 


o 0 X 




f^9 
o z * 


R 


ft 4 S 


1 s 

X J 


US 

VJ o 


-1 0-369-4 93-1487 

J_ W J VJ ^J —J ± T U / 


Sequence 


1487, Ap 


P ^ 9 
0 OZ 


R 


^9 
oz . 


R 


ft 4 7 


1 4 

X " 


US 

VJ iJ 


-1 0-128-7 14-8064 

XVJ X^iU rX^ \J \J \J ^ 


Sequence 


8064, Ap 


ft "3 "3 

O J J 


R 


f^9 
o z . 


R 
> 


ft ft2 

(J U i 


1 2 


us 

>J kJ 


-1 0-177-687-2 

_1_ VJ +Lr 1 f VJ VJ / ZL. 


Sequence 


2, Appli 


ft 1 4 

O O 1 


R 
-J 


69 


R 


99 3 


12 


us 


-10-114-27 0-152 

-1. VJ _L_ _i_ ^X / VJ _l_ ^ £_» 


Sequence 


152, App 


ft "3 S 

O J J 


R 

>J 


69 


R 
-J 


930 


14 


us 


-10-113-7 9 4A-1 

-I. VJ -X_ _L, -s_J ' —J X J J- 


Sequence 


1, Appli 


ft "3 £ 
D jo 


R 


f^9 
o z . 


R 


9^1 


1 4 

X ^ 


us 

VJ kJ 


-1 0-032-585-7545 

1 VJ VJ «^J Zl* VJ m^J 1 ^J T tj 


Sequence 


7545, Ap 


ft ^7 
DO/ 


R 


62 

vj zl • 


R 


933 


1 2 


us 


-10-415-934-3 

VJ ^X _i_ *J — / ^X ^J 1 


Sequence 


3, Appli 


ft ^ft 
O JO 


R 


62 


R 


956 

w 


12 


us 


-10-2 82-122A-51168 


Sequence 


51168, A 


ft ^9 


R 

-J 


62 


5 


972 

— ^ I z^ 


12 


us 


-10-424-59 9-188385 


Sequence 


188385, 


ft 4 n 

o *± vj 


R 
-J 


62 


5 


972 


12 


us 


-10-42 5-114-38933 


Sequence 


38933, A 


ft 4 1 

O *± X 


R 


62 


R 


97 4 


15 


us 


-10-094-466-20 

_1_ VJ VJ — X X X VJ VJ VJ 


Sequence 


20, Appl 


ft 4 9 


R 
_j 


62 


5 

— / 


976 


15 


us 

^J/ 


-10-369-4 93-16784 


Sequence 


16784, A 


ft 4 3 

O 1 J 


R 


62 


5 


979 


12 


us 


-10-2 82-122A-47372 


Sequence 


47372, A 


ft 4 4 


R 
— > 


62 


5 


984 


15 


us 


-10-369-493-20751 


Sequence 


20751, A 


ft 4 S 

U T J 


R 
-j 


62 


5 


998 


15 


us 


-10-369-4 9 3-13474 


Sequence 


13474, A 


ft 4 £ 

0 4 0 


R 


69 


5 


1014 

_i_ W _i_ A 


15 


us 


-10-320-7 97-3327 


Sequence 


3327, Ap 


847 


5 


62 . 


5 


1049 


12 


us 


-10-211-462-6 


Sequence 


6, Appli 


848 


5 


62 . 


5 


1049 


15 


us 


-10-236-031B-52 


Sequence 


52, Appl 


849 


5 


62. 


5 


1049 


15 


us 


-10-295-027-4 


Sequence 


4, Appli 


850 


5 


62 . 


5 


1049 


15 


us 


-10-295-027-726 


Sequence 


726, App 


851 


5 


62 . 


5 


1049 


15 


us 


-10-295-027-1166 


Sequence 


1166, Ap 


852 


5 


62 . 


5 


1049 


15 


us 


-10-301-330-2 


Sequence 


2, Appli 


853 


5 


62. 


5 


1059 


15 


us 


-10-369-493-12522 


Sequence 


12522, A 


854 


5 


62. 


5 


1063 


12 


us 


-10-425-114-39245 


Sequence 


39245, A 



855 

*S 


5 


62 . 


5 


1066 


15 


US- 10-369-4 93-2 0392 


Sequence 20392, A 


8 56 

U -J V 


5 


62 . 


5 


1083 


10 


US-09-835-976B-2 


Sequence 2, Appli 


857 


5 


62 . 


5 


1090 


12 


US-10-276-774-2321 


Sequence 2321, Ap 


8 5 8 

U J U 


5 


62 


5 


1120 


15 


US-1 0-3 69-4 9 3-1 3166 


Sequence 13166, A 


8 ■=> 9 

\J k_J 


5 


62 


5 


1132 


15 


US-10-2 89-7 62-4 66 


Sequence 466, App 


8 60 

Vw/ w 


5 


62 . 


5 


1167 


15 


US-10-3 69-4 93-1192 9 


Sequence 11929, A 


8 61 

\J w J— 


5 


62 . 


5 


1178 


12 


US-10-2 82-122A-72168 


Sequence 72168, A 


8 62 


5 


62 . 


5 


1178 


12 


US-10-2 82-122A-7 644 9 


Sequence 76449, A 


8 63 

i-* \J 


5 


62 . 


5 


1181 


14 


US-10-199-937-139 


Sequence 139, App 


8 64 


5 


62 


5 


1221 


12 


US-10-2 8 2-122A-4 8 35 0 


Sequence 48350, A 


8 65 

\J \J 


5 


62 


5 


1257 


14 


US- 10- 15 6-7 61-12 8 33 


Sequence 12833, A 


8 66 

W \J \J 


5 


62 


5 


1397 


15 


US- 10-3 69-4 93-24 13 


Sequence 2413, Ap 


8 67 


5 


62 


5 


1407 


12 


US-10-2 82 -12 2A- 54 67 8 


Sequence 54678, A 

A 9 


8 68 
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ALIGNMENTS 



RESULT 1 
US-09-896-874-8 

; Sequence 8, Application US/09896874 
; Patent No. US200200 16320A1 
; GENERAL INFORMATION: 
; APPLICANT: Fang, Lawrence Y. 
APPLICANT: John, Varghese 

TITLE OF INVENTION: COMPOUNDS TO TREAT ALZHEIMER'S DISEASE 
; FILE REFERENCE: 13615. 40USU1 

; CURRENT APPLICATION NUMBER: US/09/896,874 
CURRENT FILING DATE: 2001-06-29 
PRIOR APPLICATION NUMBER: US 60/215,323 
PRIOR FILING DATE: 2000-06-30 
; NUMBER OF SEQ ID NOS : 9 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 
LENGTH: 9 
TYPE: PRT 

ORGANISM: Artificial Sequence 
; FEATURE: 



; OTHER INFORMATION: Synthetic 
US-09-896-874-8 



Query Match 100.0%; Score 8; DB 9 ; Length 9; 

Best Local Similarity 100.0%; Pred. No. 9.6e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EWLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 2 
US-09-896-139-8 

Sequence 8, Application US/09896139 
Patent No. US20020128255A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF 



Beck, James P. 
Fang, Lawrence Y. 
Freskos, John N. 
Gailunas, Andrea 
Horn, Roy 

Jagodizinska , Barbara 
John, Varghese 
Maillaird, Michel 
Pulley, Shon R. 
TenBrink, Ruth E. 
INVENTION: COMPOUNDS TO TREAT ALZHEIMER'S DISEASE 



FILE REFERENCE: 13615. 25USU4 

CURRENT APPLICATION NUMBER: US/ 0 9/ 8 9 6 , 13 9 
CURRENT FILING DATE: 2001-06-29 



PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 



NUMBER: US 60/215,323 

2000-06-30 
NUMBER: US 60/252,736 
2000-11-22 
PRIOR APPLICATION NUMBER: US 60/255,956 
PRIOR FILING DATE: 2000-12-15 
PRIOR APPLICATION NUMBER: US 60/268,497 
PRIOR FILING DATE: 2001-02-13 
PRIOR APPLICATION NUMBER: US 60/279,779 
PRIOR FILING DATE: 2001-03-29 
PRIOR APPLICATION NUMBER: US 60/295,589 
PRIOR FILING DATE: 2001-06-04 
NUMBER OF SEQ ID NOS : 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 8 
LENGTH: 9 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: Synthetic 
US-09-896-139-8 



Query Match 100.0%; Score 8; DB 9; Length 9; 

Best Local Similarity 100.0%; Pred. No. 9.6e+05; 
Matches 8; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 3 
US-09-895-843-8 

Sequence 8, Application US/09895843 
Patent No. US 2 002 014 317 7A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Beck, James P. 
Fang, Lawrence Y. 
Freskos, John N. 
Gailunas, Andrea 
Horn, Roy 

Jagodizins ka, Barbara 
John, Varghese 
Maillaird, Michel 
Pulley, Shon R. 
TenBrink, Ruth E. 
TITLE OF INVENTION: COMPOUNDS TO TREAT ALZHEIMER 1 S DISEASE 
FILE REFERENCE: 13615. 41USU1 

CURRENT APPLICATION NUMBER: US/ 09/895, 843 
CURRENT FILING DATE: 2001-06-29 
PRIOR APPLICATION NUMBER: US 60/215,323 
PRIOR FILING DATE: 2000-06-30 
NUMBER OF SEQ ID NOS : 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 8 
LENGTH: 9 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Synthetic 
US-09-895-843-8 

Query Match 100.0%; Score 8; DB 9; Length 9; 

Best Local Similarity 100.0%; Pred. No. 9.6e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

I I M I I I I 

Db 2 EVNLDAEF 9 



RESULT 4 
US-09-895-871-8 

; Sequence 8, Application US/09895871 

; Publication No. US 2 003 009 68 64A1 

; GENERAL INFORMATION: 

; APPLICANT: Fang, Lawrence Y. 

; APPLICANT: Horn, Roy 

; APPLICANT: John, Varghese 

; APPLICANT: Maillaird, Michel 

TITLE OF INVENTION: COMPOUNDS TO TREAT ALZHEIMER 1 S DISEASE 

FILE REFERENCE: 13615. 21USU1 
; CURRENT APPLICATION NUMBER: US/ 0 9/ 8 95 , 87 1 



CURRENT FILING DATE: 2001-06-29 
PRIOR APPLICATION NUMBER : US 60/215,323 
PRIOR FILING DATE: 2000-06-30 
NUMBER OF SEQ ID NOS : 9 
SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 

LENGTH : 9 

TYPE : PRT 
; ORGANISM: Artificial Sequence 

FEATURE : 

OTHER INFORMATION: Synthetic 
US-09-895-871-8 

Query Match 100.0%; Score 8; DB 10; Length 9; 

Best Local Similarity 100.0%; Pred. No. 9.6e+05; 
Matches 8; Conservative 0; Mismatches 0; Inclels 

Qy 1 EVNLDAEF 8 

I I I II I I I 
Db 2 EVNLDAEF 9 



RESULT 5 
US-10-299-739-8 

Sequence 8, Application US/10299739 
Publication No. US20040039064A1 
GENERAL INFORMATION: 
APPLICANT: Elan Pharmaceuticals, Inc. 
APPLICANT: Pharmacia & Upjohn Company 
APPLICANT: Romero, Arthur G. 
APPLICANT: Schostarez, Heinrich J. 
APPLICANT: Christina, McGrain M. 

TITLE OF INVENTION: Amine 1,2- and 1,3-Diol Compounds 
FILE REFERENCE: 01-1724-B 

CURRENT APPLICATION NUMBER: US/10/299,739 
CURRENT FILING DATE: 2003-11-19 
PRIOR APPLICATION NUMBER: 60/333,081 
PRIOR FILING DATE: 2001-11-19 
PRIOR APPLICATION NUMBER: 60/334,000 
PRIOR FILING DATE: 2001-11-18 
PRIOR APPLICATION NUMBER: 60/362,752 
PRIOR FILING DATE: 2002-03-08 
NUMBER OF SEQ ID NOS: 9 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 8 
LENGTH: 9 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: synthetic peptide 
US-10-299-739-8 

Query Match 100.0%; Score 8; DB 12; Length 9; 

Best Local Similarity 100.0%; Pred. No. 9.6e+05; 
Matches 8; Conservative 0; Mismatches 0; Indels 



Qy 



1 EVNLDAEF 8 



MINIM 

Db 2 EVNLDAEF 



RESULT 6 
US-10-192-625-8 

; Sequence 8, Application US/10192625 
; Publication No. US2 003 0 0 83353A1 
; GENERAL INFORMATION: 

; APPLICANT: Schostarez, Heinrich J. 
; APPLICANT: Chrusciel, Robert A 

; TITLE OF INVENTION: Diaminediols for the Treatment of Alzheimer's Disease 
FILE REFERENCE: 01-1645-B 

CURRENT APPLICATION NUMBER: US/ 1 0/ 192 , 62 5 

CURRENT FILING DATE: 2002-07-10 

PRIOR APPLICATION NUMBER: 60/304,305 
; PRIOR FILING DATE: 2001-07-10 
; PRIOR APPLICATION NUMBER: 60/334,480 

PRIOR FILING DATE: 2001-07-10 
; NUMBER OF SEQ ID NOS : 9 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 
LENGTH: 9 
; TYPE: PRT 

; ORGANISM: Artificial Sequence 
. FEATURE : 

OTHER INFORMATION: synthetic peptide 
; FEATURE : 

; NAME/ KEY: MI S C_FEATURE 
; LOCATION: (1) . . (1) 

; OTHER INFORMATION: N-terminal biotin 
US-10-192-625-8 

Query Match 100.0%; Score 8; DB 14; Length 9; 

Best Local Similarity 100.0%; Pred. No. 9.6e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

M M M M 
Db 2 EVNLDAEF 9 



RESULT 7 
US-10-192-424-8 

; Sequence 8, Application US/10192424 
; Publication No. US2 003008 335 6A1 
; GENERAL INFORMATION: 

; APPLICANT: Schostarez, Heinrich J. 
; APPLICANT: Chrusciel, Robert A. 

; TITLE OF INVENTION: Statine Derivatives for the Treatment of Alzheimer's 
Disease 

; FILE REFERENCE: 02-154-A 

CURRENT APPLICATION NUMBER: US/ 1 0/ 1 92 , 42 4 
; CURRENT FILING DATE: 2002-07-10 

PRIOR APPLICATION NUMBER: 60/204,128 

PRIOR FILING DATE: 2001-07-10 
; PRIOR APPLICATION NUMBER: 60/327,424 



PRIOR FILING DATE: 2001-10-05 
NUMBER OF SEQ ID NOS : 9 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 
; LENGTH: 9 
TYPE: PRT 
; ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: synthetic peptide 
FEATURE : 

NAME/ KEY : MISC_FEATURE 

LOCATION: (1) . . (1) 
; OTHER INFORMATION: N-terminal biotin 
US-10-192-424-8 

Query Match 100.0%; Score 8; DB 14; Length 9; 

Best Local Similarity 100.0%; Pred. No. 9.6e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I EE I I I I I 

Db 2 EVNLDAEF 9 



RESULT 8 

US-10-183-126A-8 

; Sequence 8, Application US/10183126A 
; Publication No. US 2 0 03008 35 18A1 
; GENERAL INFORMATION: 

; APPLICANT: Pharmacia and Upjohn Company 
; APPLICANT: John, Varghese 
; APPLICANT: Horn, Roy 

APPLICANT: Tucker, John 
; TITLE OF INVENTION: Substituted Alcohols Useful in Treatment of Alzheimer's 
Disease 

FILE REFERENCE: 01-1704-C 

CURRENT APPLICATION NUMBER: US / 1 0 / 1 8 3 , 12 6A 
; CURRENT FILING DATE: 2002-09-11 
; NUMBER OF SEQ ID NOS: 9 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 

LENGTH : 9 

TYPE: PRT 
; ORGANISM: Artificial Sequence 

FEATURE : 

; OTHER INFORMATION: Synthetic oligopeptides substrate with cleavage site of 
beta-secr 

; OTHER INFORMATION: etase and with optionally detectable tag. 
US-10-183-126A-8 

Query Match 100.0%; Score 8; DB 14; Length 9; 

Best Local Similarity 100.0%; Pred. No. 9.6e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 9 
US-10-171-343-8 

; Sequence 8, Application US/10171343 
; Publication No. US20030092747A1 
; GENERAL INFORMATION: 

; APPLICANT: Schostarez, Heinrich J. 
; APPLICANT: Chrusciel, Rober A. 

; TITLE OF INVENTION: Aminediols for the Treatment of Alzheimer's Disease 

FILE REFERENCE: 01-1644-B 
; CURRENT APPLICATION NUMBER: US/ 1 0/ 17 1 , 34 3 

CURRENT FILING DATE: 2002-06-13 

PRIOR APPLICATION NUMBER: US 60/297,827 

PRIOR FILING DATE: 2001-06-13 

PRIOR APPLICATION NUMBER: US 60/333,084 

PRIOR FILING DATE: 2001-11-19 

NUMBER OF SEQ ID NOS : 9 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 
; LENGTH : 9 
TYPE : PRT 
; ORGANISM: Artificial Sequence 
; FEATURE : 

; OTHER INFORMATION: synthetic peptide 
FEATURE : 

NAME/KEY : MI SC_FEATURE 
LOCATION: (1) . . (1) 

OTHER INFORMATION: N-terminal biotin 
US-10-171-343-8 

Query Match 100.0%; Score 8; DB 14; Length 9; 

Best Local Similarity 100.0%; Pred. No. 9.6e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I 1 I I I 
Db 2 EVNLDAEF 9 



RESULT 10 
US-10-264-707-8 

; Sequence 8, Application US/10264707 
; Publication No. US2 0 03012 53 65A1 
; GENERAL INFORMATION: 
; APPLICANT: Fisher, Jed F. 
; APPLICANT: Jacobs, Jon S. 
; APPLICANT: Scherer, Brian S. 
; TITLE OF INVENTION: Hydroxypropylamines 
FILE REFERENCE: 01-1555-B 

CURRENT APPLICATION NUMBER: US / 1 0 / 2 64 , 7 07 

CURRENT FILING DATE: 2002-10-04 
; PRIOR APPLICATION NUMBER: 60/327,149 
; PRIOR FILING DATE: 2001-10-04 

PRIOR APPLICATION NUMBER: 60/334,058 

PRIOR FILING DATE: 2001-11-28 
; NUMBER OF SEQ ID NOS: 9 



; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 

LENGTH : 9 

TYPE: PRT 
; ORGANISM: Artificial Sequence 

FEATURE : 

OTHER INFORMATION: synthetic peptide 
; FEATURE : 

NAME/ KEY : MISC_FEATURE 
; LOCATION: (1) . . (1) 

OTHER INFORMATION: N-terminal biotin 
US-10-264-707-8 

Query Match 10 0.0%; Score 8; DB 14; Length 9; 

Best Local Similarity 100.0%; Pred. No. 9.6e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 11 
US-10-066-319-4 

; Sequence 4, Application US/10066319 

; Publication No. US2 0030 14 7 8 10A1 

; GENERAL INFORMATION: 

; APPLICANT: Ross, Brian D. 

; APPLICANT: Rehemtulla, Alnawaz 

TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR REPORTING 

TITLE OF INVENTION: OF PROTEASE ACTIVITY WITHIN THE SECRETORY PATHWAY 

FILE REFERENCE: 11203-007001 

CURRENT APPLICATION NUMBER: US/ 10/ 066, 319 
CURRENT FILING DATE: 2002-06-17 
NUMBER OF SEQ ID NOS : 18 
; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 4 
LENGTH: 9 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-066-319-4 

Query Match 100.0%; Score 8; DB 14; Length 9; 

Best Local Similarity 100.0%; Pred. No. 9.6e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I i I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 12 
US-10-337-075-8 

; Sequence 8, Application US/10337075 

; Publication No. US2 003 0 16658 0A1 

; GENERAL INFORMATION: 

; APPLICANT: Warpehoski, Martha A. 



; APPLICANT: Jagodzinska, Barbara 

; TITLE OF INVENTION: Substituted Amino Carboxamides for the Treatment of 
Alzheimer's Disease 

FILE REFERENCE: 01-1795-C 

CURRENT APPLICATION NUMBER: US/ 10/ 337 , 07 5 
; CURRENT FILING DATE: 2003-01-06 

PRIOR APPLICATION NUMBER: 60/345,316 
; PRIOR FILING DATE: 2002-01-04 
; PRIOR APPLICATION NUMBER: 60/350,419 

PRIOR FILING DATE: 2002-01-18 
; NUMBER OF SEQ ID NOS : 9 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 
LENGTH : 9 
TYPE: PRT 
; ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: synthetic peptide 

FEATURE : 
; NAME /KEY: MI S C_FEATURE 

LOCATION: (1) . . (1) 
; OTHER INFORMATION : N-terminal biotin 
US-10-337-075-8 

Query Match 100.0%; Score 8; DB 14; Length 9; 

Best Local Similarity 100.0%; Pred. No. 9.6e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 13 
US-10-160-777-8 

Sequence 8, Application US/10160777 
Publication No. US2 003 01667 17A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Freskos, John 
Brown, David L. 
Fobian, Yvette M. 
Fang, Larry 
Romero, Arthur G. 
Varghese, John 
TITLE OF INVENTION: Hydroxy Alkyl Amines 
FILE REFERENCE: 01-1632-C 

CURRENT APPLICATION NUMBER: US/ 1 0/ 1 60 , 7 7 7 
CURRENT FILING DATE: 2002-10-15 
PRIOR APPLICATION NUMBER: 60/343,772 
PRIOR FILING DATE: 2001-12-28 
PRIOR APPLICATION NUMBER: 60/332,639 
PRIOR FILING DATE: 2001-11-19 
PRIOR APPLICATION NUMBER: 60/295,332 
PRIOR FILING DATE: 2001-06-01 
NUMBER OF SEQ ID NOS: 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 8 



; LENGTH: 9 

TYPE: PRT 
; ORGANISM: Artificial sequence 

FEATURE: 

; OTHER INFORMATION: synthetic peptide 
FEATURE : 

NAME/ KEY: MI S C_FEATURE 
LOCATION: (1) . . (1) 

OTHER INFORMATION: N-terminal biotin 
US-10-160-777-8 

Query Match 100.0%; Score 8; DB 14; Length 9; 

Best Local Similarity 100.0%; Pred. No. 9.6e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I 1 I I 
Db 2 EVNLDAEF 9 



RESULT 14 
US-10-192-543-8 

; Sequence 8, Application US/10192543 
; Publication No. US2 004 0 0190 8 6Al 
; GENERAL INFORMATION: 

; APPLICANT: Schostarez, Heinrich J. 
; APPLICANT: Chrusciel, Robert A. 

; TITLE OF INVENTION: Aminediols for the Treatment of Alzheimer's Disease 

FILE REFERENCE: 02-156-A 
; CURRENT APPLICATION NUMBER: US/ 1 0/ 192 , 543 
; CURRENT FILING DATE: 2002-07-10 

PRIOR APPLICATION NUMBER: 60/304,129 

PRIOR FILING DATE: 2001-07-10 

NUMBER OF SEQ ID NOS : 9 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 
; LENGTH: 9 
TYPE: PRT 
; ORGANISM: Artificial Sequence 
FEATURE : 

; OTHER INFORMATION: synthetic peptide 
FEATURE: 

NAME/ KEY: MISC_FEATURE 
LOCATION: (1) . . (1) 

OTHER INFORMATION: N-terminal biotin 
US-10-192-543-8 

Query Match 100.0%; Score 8; DB 15; Length 9; 

Best Local Similarity 100.0%; Pred. No. 9.6e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

MINIM 
Db 2 EVNLDAEF 9 



RESULT 15 



US-09-794-927-63 

Sequence 63, Application US/09794927 
Patent No. US2 0 01 00 16324A1 
GENERAL INFORMATION: 
APPLICANT: Gurney, Mark E. 
APPLICANT: Bienkowski, Michael J. 
APPLICANT: Heinrikson, Robert L. 
APPLICANT: Parodi, Luis A. 
APPLICANT: Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND 

TITLE OF INVENTION: USES 
TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280FG 
CURRENT APPLICATION NUMBER: US/ 0 9/ 7 9 4 , 92 7 
CURRENT FILING DATE: 2001-02-27 
PRIOR APPLICATION NUMBER: 09/416 r 901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US 9 9 / 2 0 8 8 1 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 63 
LENGTH: 10 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION : Description of Artificial Sequence: synthetic 
US-09-794-927-63 

Query Match 100.0%; Score 8; DB 9; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 16 
US-09-795-847-63 

Sequence 63, Application US/09795847 
Patent No. US2 00100182 08A1 
GENERAL INFORMATION: 



Gurney, Mark E. 
Bienkowski, Michael J. 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan, Riqiang 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 



AND 



TITLE OF INVENTION: USES 
TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280DE 
CURRENT APPLICATION NUMBER: US/ 09/795, 847 
CURRENT FILING DATE: 2001-02-28 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US 9 9/ 2 0 8 8 1 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 63 
LENGTH: 10 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence: synthetic 
US-09-795-847-63 

Query Match 100.0%; Score 8; DB 9; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

IlllCIti 

Db 2 EVNLDAEF 9 



RESULT 17 
US-09-794-743-63 

Sequence 63, Application US/09794743 
Patent No. US2 00 1002 139 1A1 
GENERAL INFORMATION: 
APPLICANT: Gurney, Mark E. 
APPLICANT: Bienkowski, Michael J. 
APPLICANT: Heinrikson, Robert L. 
APPLICANT: Parodi, Luis A. 
APPLICANT: Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND 

TITLE OF INVENTION: USES 
TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280BC 
CURRENT APPLICATION NUMBER: US/ 0 9 / 7 9 4 , 7 4 3 
CURRENT FILING DATE: 2001-02-27 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 



; PRIOR APPLICATION NUMBER: PCT/US99/20881 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 60/101,594 

PRIOR FILING DATE: 1998-09-24 

NUMBER OF SEQ ID NOS : 7 3 
; SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 63 
LENGTH: 10 
; TYPE: PRT 

; ORGANISM: Artificial Sequence 
FEATURE : 

; OTHER INFORMATION: Description of Artificial Sequence: synthetic 
US-09-794-743-63 

Query Match 100.0%; Score 8; DB 9; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 EVNLDAEF 8 

I I I I I I i I 
Db 2 EVNLDAEF 9 



RESULT 18 
US-09-794-748-63 

Sequence 63, Application US/09794748 
Patent No. US20020037315A1 
GENERAL INFORMATION: 



AND 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Gurney, Mark E. 
Bienkowski, Michael J. 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan, Riqiang 



TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 

TITLE OF INVENTION: USES 
TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280JL 
CURRENT APPLICATION NUMBER: US/ 0 9/ 7 94 , 7 4 8 
CURRENT FILING DATE: 2001-02-27 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
NUMBER OF SEQ ID 



1999-10-13 
NUMBER: 60/155, 493 

1999-09-23 
NUMBER: 09/404, 133 

1999-09-23 
NUMBER: PCT/US99/20 88 1 

1999-09-23 
NUMBER: 60/101, 594 
1998-09-24 
NOS: 7 3 

2 . 0 



SOFTWARE: Patentln Ver. 
SEQ ID NO 63 
LENGTH: 10 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 



; OTHER INFORMATION: Description of Artificial Sequence: synthetic 
US-09-794-748-63 



Query Match 100.0%; Score 8; DB 9; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

II I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 19 
US-09-796-264-5 

Sequence 5, Application US/09796264 
Patent No. US2 002 0 04 9303A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 



PRIOR APPLICATION 
PRIOR FILING DATE 



Tang, Jordan J.N. 
Lin, Xinli 
Koelsch, Gerald 

TITLE OF INVENTION: Catalytically Active Recombinant Memapsin and Methods 
TITLE OF INVENTION: of Use Thereof 
FILE REFERENCE: OMRF 17 9 

CURRENT APPLICATION NUMBER: US/ 09/7 9 6 , 2 64 
CURRENT FILING DATE: 2001-02-28 
PRIOR APPLICATION NUMBER: 09/604,608 
PRIOR FILING DATE: 2000-06-27 

NUMBER: 6 0/168, 0 60 
1999-11-30 
PRIOR APPLICATION NUMBER: 60/177,836 
PRIOR FILING DATE: 2000-01-25 
PRIOR APPLICATION NUMBER: 60/178,368 
PRIOR FILING DATE: 2000-01-27 
PRIOR APPLICATION NUMBER: 60/210,292 
PRIOR FILING DATE: 2000-06-08 
NUMBER OF SEQ ID NOS : 31 
SOFTWARE: PatentlnVer. 2.1 
SEQ ID NO 5 
LENGTH: 10 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: Description of Artificial Sequence: Synthetic 
OTHER INFORMATION: Peptide 
US-09-796-264-5 



Query Match 100.0%; Score 8; DB 9; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 



Db 



1 EVNLDAEF 8 

I I I I I I I I 

2 EVNLDAEF 9 



RESULT 2 0 
US-09-794-925-63 



Sequence 63, Application US/09794925 
Patent No. US 2 0020064819A1 
GENERAL INFORMATION: 
APPLICANT: Gurney, Mark E. 
APPLICANT: Bienkowski, Michael J. 
APPLICANT: Heinrikson, Robert L. 
APPLICANT: Parodi, Luis A. 
APPLICANT: Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND USES 

TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280HI 
CURRENT APPLICATION NUMBER: US / 0 9 / 7 94 , 92 5 
CURRENT FILING DATE: 2001-02-27 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/2088 1 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 63 
LENGTH: 10 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence: synthetic 
US-09-794-925-63 

Query Match 100.0%; Score 8; DB 9; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 EVNLDAEF 8 

I M I I I I I 
Db 2 EVNLDAEF 9 



RESULT 21 
US-09-681-442-63 

Sequence 63, Application US/09681442 
Patent No. US2002008 1634A1 
GENERAL INFORMATION: 
APPLICANT: Gurney, Mark E. 
APPLICANT: Bienkowski, Michael J. 
APPLICANT: Heinrikson, Robert L. 
APPLICANT: Parodi, Luis A. 
APPLICANT: Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND USES 

TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280FG 



; CURRENT APPLICATION NUMBER: US/09/ 68 1, 442 
; CURRENT FILING DATE: 2001-04-05 

PRIOR APPLICATION NUMBER: 09/416,901 

PRIOR FILING DATE: 1999-10-13 

PRIOR APPLICATION NUMBER: 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 09/404,133 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US99/20881 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 60/101,594 

PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 63 
LENGTH: 10 
; TYPE: PRT 

; ORGANISM: Artificial Sequence 
FEATURE : 

; OTHER INFORMATION: Description of Artificial Sequence: synthetic 
US-09-681-442-63 

Query Match 100.0%; Score 8; DB 9; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

M I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 22 
US-09-845-226-5 

; Sequence 5, Application US/09845226 
; Patent No. US20020115600A1 
; GENERAL INFORMATION: 

APPLICANT: Tang, Jordan J.N. 

APPLICANT: Hong, Lin 

APPLICANT: Ghosh, Arun K. 
; TITLE OF INVENTION: Inhibitors of Memapsin 2 and Use Thereof 

FILE REFERENCE: OMRF 182 
; CURRENT APPLICATION NUMBER: US / 0 9 / 8 4 5 , 2 2 6 
; CURRENT FILING DATE: 2001-04-30 

PRIOR APPLICATION NUMBER: 09/603,713 

PRIOR FILING DATE: 2000-06-27 

PRIOR APPLICATION NUMBER: 60/168,060 
; PRIOR FILING DATE: 1999-11-30 

PRIOR APPLICATION NUMBER: 60/177,836 

PRIOR FILING DATE: 2000-01-25 
; PRIOR APPLICATION NUMBER: 60/178,368 
; PRIOR FILING DATE: 2000-01-27 
; PRIOR APPLICATION NUMBER: 60/210,292 

PRIOR FILING DATE: 2000-06-08 

NUMBER OF SEQ ID NOS: 31 

SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 5 
LENGTH: 10 



; TYPE: PRT 

; ORGANISM: Artificial Sequence 
; FEATURE : 

; OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

OTHER INFORMATION: Peptide 
US-09-845-226-5 

Query Match 100.0%; Score 8; DB 9; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 23 
US-09-795-903A-5 

Sequence 5, Application US/09795903A 
Patent No. US2 002 01 64 7 60A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 



Tang, Jordan J.N. 
Lin, Xinli 
Koelsch, Gerald 

TITLE OF INVENTION: Catalytically Active Recombinant Memapsin and Methods 
TITLE OF INVENTION: of Use Thereof 
FILE REFERENCE: OMRF 17 9 

CURRENT APPLICATION NUMBER: US/ 09/ 7 95, 903A 
CURRENT FILING DATE: 2001-02-28 
PRIOR APPLICATION NUMBER: 09/604,608 
PRIOR FILING DATE: 2000-06-27 
PRIOR APPLICATION NUMBER: 60/168,060 
PRIOR FILING DATE: 1999-11-30 
PRIOR APPLICATION NUMBER: 60/177,836 
PRIOR FILING DATE: 2000-01-25 
PRIOR APPLICATION NUMBER: 60/178,368 
PRIOR FILING DATE: 2000-01-27 
PRIOR APPLICATION NUMBER: 60/210,292 
PRIOR FILING DATE: 2000-06-08 
NUMBER OF SEQ ID NOS : 31 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 5 
LENGTH: 10 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence: Synthetic 
OTHER INFORMATION: Peptide 
US-09-795-903A-5 

Query Match 100.0%; Score 8; DB 9; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

MINIM 

Db 2 EVNLDAEF 9 



RESULT 2 4 
US-09-869-414-63 

; Sequence 63, Application US/09869414 

; Publication No. US2 0 03 007 722 6A1 

; GENERAL INFORMATION: 

; APPLICANT: Beinkowski et al . 

TITLE OF INVENTION: ALZHEIMER 1 S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND USES 

; TITLE OF INVENTION: THEREFOR 

FILE REFERENCE: 28341/6280M 
; CURRENT APPLICATION NUMBER: US/ 0 9/8 69 , 4 14 

CURRENT FILING DATE: 2001-06-27 

PRIOR APPLICATION NUMBER: 09/416,901 

PRIOR FILING DATE: 1999-10-13 

PRIOR APPLICATION NUMBER: 60/155,493 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: 09/404,133 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US99/2 08 81 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 60/101,594 

PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 
; SOFTWARE: PatentlnVer. 2.0 
; SEQ ID NO 63 
; LENGTH: 10 
; TYPE: PRT 

; ORGANISM: Artificial Sequence 
; FEATURE : 

; OTHER INFORMATION: Description of Artificial Sequence: synthetic 
US-09-869-414-63 

Query Match 100.0%; Score 8; DB 10; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

[ I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 2 5 
US-09-548-366-63 

Sequence 63, Application US/09548366 
Publication No. US2 00301 043 65A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF 



Gurney, Mark E. 
Bienkowski, Michael J. 
Heinrikson, Robert L. 
Parodi , Luis A. 
Yan, Riqiang 

INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 



AND 



; TITLE OF INVENTION: USES THEREFOR 
; FILE REFERENCE: 28341/6280A 



CURRENT APPLICATION NUMBER: US/ 09/548, 366 

CURRENT FILING DATE: 2000-04-12 

PRIOR APPLICATION NUMBER: 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 09/404,133 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: PCT/US 9 9/2 0 8 8 1 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: 60/101,594 
; PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 65 

SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 63 
LENGTH: 10 
TYPE : PRT 
; ORGANISM: Artificial Sequence 
FEATURE : 

; OTHER INFORMATION: Description of Artificial Sequence: synthetic 
US-09-548-366-63 

Query Match 100.0%; Score 8; DB 10; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I 1 I I I 
Db 2 EVNLDAEF 9 



RESULT 2 6 
US-10-652-927-63 

; Sequence 63, Application US/10652927 

; Publication No. US2 0040043408A1 

; GENERAL INFORMATION: 

; APPLICANT: Gurney et al . 

; TITLE OF INVENTION: Alzheimer's Disease Secretase, APP Substrates Therefor 
and Uses 

; TITLE OF INVENTION: Therefor 
; FILE REFERENCE: 29915/6280N3 

; CURRENT APPLICATION NUMBER: US/10/652,927 

CURRENT FILING DATE: 2003-08-29 

PRIOR APPLICATION NUMBER: 09/794,925 

PRIOR FILING DATE: 2001-02-27 

PRIOR APPLICATION NUMBER: 09/416,901 

PRIOR FILING DATE: 1999-10-13 

PRIOR APPLICATION NUMBER: 60/155,493 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: 09/404,133 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US 9 9 /2 0 8 8 1 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 60/101,594 

PRIOR FILING DATE: 1998-09-24 

NUMBER OF SEQ ID NOS: 74 
; SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 63 
LENGTH: 10 



TYPE: PRT 
; ORGANISM: Artificial sequence 
FEATURE : 

OTHER INFORMATION: Synthetic peptide 
US-10-652-927-63 

Query Match 100.0%; Score 8; DB 12; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I i I I I I 
Db 2 EVNLDAEF 9 



RESULT 2 7 
US-10-652-830-63 

; Sequence 63, Application US/10652830 

; Publication No. US 2 0 04 0 04 8303A1 

; GENERAL INFORMATION: 

; APPLICANT: Gurney et al . 

; TITLE OF INVENTION: Alzheimer's Disease Secretase, APP Substrates Therefor 
and Uses 

; TITLE OF INVENTION: Therefor 
FILE REFERENCE: 29915/6280N1 

CURRENT APPLICATION NUMBER: US/10/652 , 830 

CURRENT FILING DATE: 2003-08-29 

PRIOR APPLICATION NUMBER: 09/794,925 

PRIOR FILING DATE: 2001-02-27 

PRIOR APPLICATION NUMBER: 09/416,901 

PRIOR FILING DATE: 1999-10-13 
; PRIOR APPLICATION NUMBER: 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 09/404,133 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US 9 9/ 2 0 8 8 1 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 60/101,594 
; PRIOR FILING DATE: 1998-09-24 

NUMBER OF SEQ ID NOS : 74 

SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 63 
LENGTH: 10 
; TYPE: PRT 

; ORGANISM: Artificial sequence 
FEATURE : 

OTHER INFORMATION: Synthetic peptide 
US-10-652-830-63 

Query Match 100.0%; Score 8; DB 12; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I i I ! 
Db 2 EVNLDAEF 9 



RESULT 2 8 
US-10-032-818-8 

; Sequence 8, Application US/10032818 
; Publication No. US2 0 03 0092 62 9A1 
; GENERAL INFORMATION: 

APPLICANT: Tang, Jordan J.N. 
; APPLICANT: Koelsch, Gerald 

APPLICANT: Ghosh, Arun K . 
; TITLE OF INVENTION: Inhibitors of Memapsin 2 and Use Thereof 
; FILE REFERENCE: 2932.1006-007 
; CURRENT APPLICATION NUMBER: US/ 1 0/ 0 32 , 8 1 8 
; CURRENT FILING DATE: 2001-12-28 

PRIOR APPLICATION NUMBER: US 60/275,756 

PRIOR FILING DATE: 2001-03-14 

PRIOR APPLICATION NUMBER: US 60/258,705 

PRIOR FILING DATE: 2000-12-28 
; NUMBER OF SEQ ID NOS : 83 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 8 

LENGTH: 10 
; TYPE: PRT 

; ORGANISM: Artificial Sequence 
FEATURE : 

; OTHER INFORMATION: Synthetic peptide 
US-10-032-818-8 

Query Match 100.0%; Score 8; DB 14; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

MINIM 
Db 2 EVNLDAEF 9 



RESULT 2 9 
US-10-050-200-22 

; Sequence 22, Application US/10050200 
; Publication No. US2 0 03 01 66037A1 
; GENERAL INFORMATION: 

APPLICANT : Fourie, Anne 
; APPLICANT: Coles, Fawn 

APPLICANT: Karlsson, Lars 
; TITLE OF INVENTION: Aggrecanase-1 and -2 Peptide Substrates and Methods 

FILE REFERENCE: ORT-1417 
; CURRENT APPLICATION NUMBER: US/ 1 0/ 0 5 0 , 2 0 0 
; CURRENT FILING DATE: 2002-01-16 

NUMBER OF SEQ ID NOS: 60 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 22 
; LENGTH: 10 
; TYPE: PRT 

; ORGANISM: Artificial sequence 
FEATURE : 

OTHER INFORMATION: peptide substrate 
US-10-050-200-22 



Query Match 100.0%; Score 8; DB 14; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.018; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I 1 I I I I I I 
Db 3 EVNLDAEF 10 



RESULT 3 0 
US-10-354-955-2 

; Sequence 2, Application US/10354955 
; Publication No. US20030171291A1 
; GENERAL INFORMATION: 

APPLICANT: Gary Christie 

APPLICANT: Ishrut Hussain 
; APPLICANT: David J. Powell 

; TITLE OF INVENTION: No. US2 0 03017 12 9 lAlel Treatment 

FILE REFERENCE: P32448 
; CURRENT APPLICATION NUMBER: US/10/354,955 

CURRENT FILING DATE: 2 003-01-30 

PRIOR APPLICATION NUMBER: 09/693,744 

PRIOR FILING DATE: 2000-10-20 

PRIOR APPLICATION NUMBER: 9925136.5 

PRIOR FILING DATE: 1999-10-22 

NUMBER OF SEQ ID NOS : 9 
; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 2 
LENGTH: 11 
TYPE: PRT 
; ORGANISM: human 
US-10-354-955-2 

Query Match 100.0%; Score 8; DB 14; Length 11; 

Best Local Similarity 100.0%; Pred. No. 0.019; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

MINIM 

Db 3 EVNLDAEF 10 



RESULT 31 
US-10-354-955-4 

; Sequence 4, Application US/10354955 

; Publication No. US200301712 91A1 

; GENERAL INFORMATION: 

; APPLICANT: Gary Christie 

; APPLICANT: Ishrut Hussain 

APPLICANT: David J. Powell 
; TITLE OF INVENTION: No. US2 003017 12 9 lAlel Treatment 
; FILE REFERENCE: P32448 

CURRENT APPLICATION NUMBER: US/ 10/ 354 , 955 

CURRENT FILING DATE: 2003-01-30 

PRIOR APPLICATION NUMBER: 09/693,744 

PRIOR FILING DATE: 2000-10-20 



PRIOR APPLICATION NUMBER: 9925136.5 
PRIOR FILING DATE: 1999-10-22 
NUMBER OF SEQ ID NOS : 9 
; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 4 
LENGTH: 11 
TYPE: PRT 
; ORGANISM: Artificial Sequence 
; FEATURE: 

; OTHER INFORMATION : fusion protein with maltose binding protein attached at 

residue 1 ' 

US-10-354-955-4 

Query Match 100.0%; Score 8; DB 14; Length 11; 

Best Local Similarity 100.0%; Pred. No. 0.019; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 3 EVNLDAEF 10 



RESULT 32 
US-09-896-874-1 

; Sequence 1, Application US/09896874 
; Patent No. US2 002 001 632 0A1 
; GENERAL INFORMATION: 
; APPLICANT: Fang, Lawrence Y. 
APPLICANT: John, Varghese 

TITLE OF INVENTION: COMPOUNDS TO TREAT ALZHEIMER'S DISEASE 

FILE REFERENCE: 13615. 40USU1 
; CURRENT APPLICATION NUMBER: US/ 09/896, 874 
; CURRENT FILING DATE: 2001-06-29 
; PRIOR APPLICATION NUMBER: US 60/215,323 
; PRIOR FILING DATE: 2000-06-30 

NUMBER OF SEQ ID NOS: 9 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 1 
; LENGTH: 12 

TYPE: PRT 
; ORGANISM: Artificial Sequence 
FEATURE: 

; OTHER INFORMATION: Synthetic 
US-09-896-874-1 

Query Match 100.0%; Score 8; DB 9; Length 12; 

Best Local Similarity 100.0%; Pred. No. 0.021; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

ill lilt 
Db 2 EVNLDAEF 9 



RESULT 33 
US-09-896-139-1 

; Sequence 1, Application US/09896139 



Patent No. US2002 012 8255A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Beck, James P. 
Fang, Lawrence Y. 
Freskos, John N. 
Gailunas, Andrea 
Horn, Roy 

Jagodi zinska , Barbara 
John, Varghese 
Maillaird, Michel 
Pulley, Shon R. 
TenBrink, Ruth E. 
TITLE OF INVENTION: COMPOUNDS TO TREAT ALZHEIMER'S DISEASE 
FILE REFERENCE: 13615. 25USU4 

CURRENT APPLICATION NUMBER: US/ 09/8 96, 13 9 
CURRENT FILING DATE: 2001-06-29 
PRIOR APPLICATION NUMBER: US 60/215,323 
PRIOR FILING DATE: 2000-06-30 
PRIOR APPLICATION NUMBER: US 60/252,736 
PRIOR FILING DATE: 2000-11-22 
PRIOR APPLICATION NUMBER: US 60/255,956 
PRIOR FILING DATE: 2000-12-15 
PRIOR APPLICATION NUMBER: US 60/268,497 
PRIOR FILING DATE: 2001-02-13 
PRIOR APPLICATION NUMBER: US 60/279,779 
PRIOR FILING DATE: 2001-03-29 
PRIOR APPLICATION NUMBER: US 60/295,589 
PRIOR FILING DATE: 2001-06-04 
NUMBER OF SEQ ID NOS : 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 1 
LENGTH: 12 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Synthetic 
US-09-896-139-1 

Query Match 100.0%; Score 8; DB 9; Length 12; 

Best Local Similarity 100.0%; Pred. No. 0.021; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

MINIM 
Db 2 EVNLDAEF 9 



RESULT 34 
US-09-895-843-1 

Sequence 1, Application US/09895843 
Patent No. US2 0 02 0143 177A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Beck, James P. 
Fang, Lawrence Y. 
Freskos, John N. 
Gailunas, Andrea 
Horn, Roy 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF 



Jagodizinska, Barbara 
John, Varghese 
Maillaird, Michel 
Pulley, Shon R. 
TenBrink, Ruth E. 
INVENTION: COMPOUNDS TO TREAT ALZHEIMER'S DISEASE 



FILE REFERENCE: 13615. 41USU1 

CURRENT APPLICATION NUMBER: US/ 09/ 8 95 , 8 4 3 
CURRENT FILING DATE: 2001-06-29 
PRIOR APPLICATION NUMBER: US 60/215,323 
PRIOR FILING DATE: 2000-06-30 
NUMBER OF SEQ ID NOS : 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 1 
LENGTH: 12 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Synthetic 
US-09-895-843-1 



Query Match 100.0%; Score 8; DB 9; Length 12; 

Best Local Similarity 100.0%; Pred. No. 0.021; 

Matches 8; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 EVNLDAEF 8 

I I I I I I I I 

2 EVNLDAEF 9 



RESULT 35 
US-09-895-871-1 

; Sequence 1, Application US/09895871 

; Publication No. US2 0 030 0968 64A1 

; GENERAL INFORMATION: 

; APPLICANT: Fang, Lawrence Y. 

; APPLICANT: Horn, Roy 

; APPLICANT: John, Varghese 

APPLICANT: Maillaird, Michel 

TITLE OF INVENTION: COMPOUNDS TO TREAT ALZHEIMER'S DISEASE 
FILE REFERENCE: 13615. 21USU1 

CURRENT APPLICATION NUMBER: US / 0 9 / 8 95 , 8 7 1 
; CURRENT FILING DATE: 2001-06-29 
; PRIOR APPLICATION NUMBER: US 60/215,323 
; PRIOR FILING DATE: 2000-06-30 
; NUMBER OF SEQ ID NOS: 9 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 1 
LENGTH: 12 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Synthetic 
US-09-895-871-1 



Query Match 100.0%; 
Best Local Similarity 100.0%; 



Score 8; DB 10; 
Pred. No. 0.021; 



Length 12; 



Matches 



8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 EVNLDAEF 8 

I I I I I ! I i 
Db 2 EVNLDAEF 9 



RESULT 3 6 
US-10-032-818-26 

; Sequence 26, Application US/10032818 
; Publication No. US2 00300 92 62 9A1 
; GENERAL INFORMATION: 

APPLICANT: Tang, Jordan J.N. 

APPLICANT: Koelsch, Gerald 

APPLICANT: Ghosh, Arun K. 
; TITLE OF INVENTION: Inhibitors of Memapsin 2 and Use Thereof 
; FILE REFERENCE: 2 932.1006-007 

CURRENT APPLICATION NUMBER: US/ 1 0/ 0 32 , 8 1 8 

CURRENT FILING DATE: 2001-12-28 
; PRIOR APPLICATION NUMBER: US 60/275,756 

PRIOR FILING DATE: 2001-03-14 

PRIOR APPLICATION NUMBER: US 60/258,705 

PRIOR FILING DATE: 2000-12-28 

NUMBER OF SEQ ID NOS : 8 3 
; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 26 
; LENGTH: 12 
; TYPE: PRT 

; ORGANISM: Artificial Sequence 
; FEATURE : 

; OTHER INFORMATION: Synthetic peptide 
US-10-032-818-26 

Query Match 100.0%; Score 8; DB 14; Length 12; 

Best Local Similarity 100.0%; Pred. No. 0.021; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I 1 f I I M I 
Db 1 EVNLDAEF 8 



RESULT 3 7 
US-10-400-273-3 

Sequence 3, Application US/10400273 
Publication No. US20040014 194A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Beyer, Brian 
Hammond, Gerald S 
Reichert, Paul 
Strickland, Corey 
Wang, Wenyan 
Weber, Patricia C 
Wong, Gwendolyn 



Zhang, Lili 

TITLE OF INVENTION: BETA- SEC RETASE CRYSTALS AND METHODS FOR PREPARING AND 
USING THE SAME 



FILE REFERENCE: JB01531-K-US 

CURRENT APPLICATION NUMBER: US/10/400, 273 

CURRENT FILING DATE: 2003-03-26 

PRIOR APPLICATION NUMBER: 60/367,937 

PRIOR FILING DATE: 2002-03-27 

NUMBER OF SEQ ID NOS : 5 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 3 
LENGTH: 12 
TYPE: PRT 
; ORGANISM: Artificial Sequence 
; FEATURE : 

; OTHER INFORMATION: cleavage sequence 
US-10-400-273-3 

Query Match 100.0%; Score 8; DB 15; Length 12; 

Best Local Similarity 100.0%; Pred. No. 0.021; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

I M I I II 1 

Db 3 EVNLDAEF 10 



RESULT 3 8 
US-10-299-739-1 

Sequence 1, Application US/10299739 
Publication No. US20040039064A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Elan Pharmaceuticals , Inc . 
Pharmacia & Up j ohn Company 
Romero, Arthur G. 
Schostarez , He in rich J . 
Christina, McGrain M. 
TITLE OF INVENTION: Amine 1,2- and 1,3-Diol Compounds 
FILE REFERENCE: 01-1724-B 

CURRENT APPLICATION NUMBER: US/ 1 0/ 2 99, 7 39 
CURRENT FILING DATE: 2003-11-19 
PRIOR APPLICATION NUMBER: 60/333,081 
PRIOR FILING DATE: 2001-11-19 
PRIOR APPLICATION NUMBER: 60/334,000 
PRIOR FILING DATE: 2001-11-18 
PRIOR APPLICATION NUMBER: 60/362,752 
PRIOR FILING DATE: 2002-03-08 
NUMBER OF SEQ ID NOS: 9 
SOFTWARE: Patentln version 3.2 
SEQ ID NO 1 

LENGTH: 13 

TYPE: PRT 

ORGANISM : Artificial Sequence 
FEATURE : 

OTHER INFORMATION: synthetic peptide 
FEATURE : 

NAME/ KEY : MI SC_FEATURE 
LOCATION: (1) . . (1) 

OTHER INFORMATION: N-terminal biotin 
FEATURE: 



NAME /KEY: MISC_FEATURE 

LOCATION: (11).. (11) 
; OTHER INFORMATION: covalent attachment of Oregon green 
US-10-299-739-1 

Query Match 100.0%; Score 8; DB 12; Length 13; 

Best Local Similarity 100.0%; Pred. No. 0.022; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 3 9 
US-10-192-625-1 

; Sequence 1, Application US/10192625 
; Publication No. US2 0 0300 83353A1 
; GENERAL INFORMATION: 

; APPLICANT: Schostarez, Heinrich J. 
; APPLICANT: Chrusciel, Robert A 

; TITLE OF INVENTION: Diaminediols for the Treatment of Alzheimer's Disease 
FILE REFERENCE: 01-1645-B 

CURRENT APPLICATION NUMBER: US/10/192, 625 

CURRENT FILING DATE: 2002-07-10 

PRIOR APPLICATION NUMBER: 60/304,305 

PRIOR FILING DATE: 2001-07-10 

PRIOR APPLICATION NUMBER: 60/334,480 

PRIOR FILING DATE: 2001-07-10 

NUMBER OF SEQ ID NOS : 9 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 1 
; LENGTH: 13 
; TYPE: PRT 

; ORGANISM: Artificial Sequence 
FEATURE : 

; OTHER INFORMATION: synthetic peptide 
FEATURE : 

NAME /KEY: MISC_FEATURE 
; LOCATION: (1) . . (1) 

OTHER INFORMATION: N-terminal biotin 
; FEATURE : 

NAME/ KEY: MISC_FEATURE 

LOCATION: (11).. (11) 
; OTHER INFORMATION: covalent attachment of Oregon green 
US-10-192-625-1 

Query Match 100.0%; Score 8; DB 14; Length 13; 

Best Local Similarity 100.0%; Pred. No. 0.022; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

MINIM 

Db 2 EVNLDAEF 9 



RESULT 4 0 



US-10-192-424-1 

; Sequence 1, Application US/10192424 
; Publication No. US2 0030083356A1 
; GENERAL INFORMATION: 

; APPLICANT: Schostarez, Heinrich J. 
; APPLICANT: Chrusciel, Robert A. 

; TITLE OF INVENTION: Statine Derivatives for the Treatment of Alzheimer's 
Disease 

FILE REFERENCE: 02-154-A 

CURRENT APPLICATION NUMBER: US/ 1 0 / 1 92 , 4 2 4 
; CURRENT FILING DATE: 2002-07-10 

PRIOR APPLICATION NUMBER: 60/204,128 

PRIOR FILING DATE: 2001-07-10 

PRIOR APPLICATION NUMBER: 60/327,424 

PRIOR FILING DATE: 2001-10-05 

NUMBER OF SEQ ID NOS : 9 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 1 

LENGTH: 13 
; TYPE: PRT 

; ORGANISM: Artificial Sequence 
FEATURE : 

; OTHER INFORMATION: synthetic peptide 
; FEATURE : 

; NAME/KEY: MISC_FEATURE 

LOCATION: (1) . . (1) 
; OTHER INFORMATION: N-terminal biotin 

FEATURE : 

NAME /KEY : MISC_FEATURE 

LOCATION: (11).. (11) 
; OTHER INFORMATION: covalent attachment of oregon green 
US-10-192-424-1 

Query Match 100.0%; Score 8; DB 14; Length 13; 

Best Local Similarity 100.0%; Pred. No. 0.022; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 

Search completed: March 26, 2004, 15:44:15 
Job time : 47.5 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on 



March 26, 2004, 15:30:32 ; Search time 36 Seconds 

(without alignments) 
70.115 Million cell updates/sec 



Title: US- 0 9-668-3 14C-7 1 

Perfect score: 8 

Sequence: 1 EVNLDAEF 8 

Scoring table: OLIGO 

Gapop 60.0 , Gapext 60.0 



Searched: 



1017041 seqs, 315518202 residues 



Word size : 



0 



Total number of hits satisfying chosen parameters: 



1017041 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Listing first 1000 summaries 



Database 



SPTREMBL 25:* 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 



sp_archea : * 
sp_bacteria : * 
sp_f ungi : * 
sp_human : * 
sp_invertebrate : * 
s p_mamma 1 : * 
sp__mhc : * 
sp_organelle : * 
sp_phage : * 

sp_plant : * 

sp_rodent : * 

sp_virus : * 

sp_vertebrate : * 

sp_unclassif ied: * 

sp__rvirus : * 

sp_bacteriap : * 

sp archeap : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No . 



Score 



% 

Query 

Match Length 



DB ID 



Description 



1 


6 


75. 


0 


54 


16 


Q8P193 


Q8pl93 streptococc 


2 


6 


75. 


0 


91 


16 


Q89LY8 


Q8 91y8 bradyrhizob 


3 


6 


75. 


0 


101 


12 


Q80PA8 


Q80pa8 spodoptera 


4 


6 


75. 


0 


173 


16 


Q87ZB6 


Q87zb6 pseudomonas 


5 


6 


75. 


0 


221 


16 


Q82WI7 


Q82wi7 nitrosomona 


6 


6 


75 . 


0 


281 


16 


Q8A4M0 


Q8a4m0 bacteroides 


7 


6 


75. 


0 


299 


2 


Q93ET9 


Q93et9 escherichia 


8 


6 


75. 


0 


299 


2 


Q9EVD7 


Q9evd7 enterococcu 


9 


6 


75. 


0 


304 


2 


Q933C0 


Q933c0 rhizobium t 


10 


6 


75. 


0 


308 


10 


Q8H0B0 


Q8h0b0 oryza sativ 


11 


6 


75 . 


0 


319 


16 


Q7V5K6 


Q7v5k6 prochloroco 


12 


6 


75 . 


0 


324 


16 


Q88P95 


Q88p95 pseudomonas 


13 


6 


75. 


0 


324 


16 


Q7U4X9 


Q7u4x9 synechococc 


14 


6 


75. 


0 


326 


16 


Q9HVW0 


Q9hvw0 pseudomonas 


15 


6 


75. 


0 


352 


2 


Q9AHQ9 


Q9ahq9 gamma-prote 


16 


6 


75. 


0 


357 


10 


Q8H0A8 


Q8h0a8 oryza sativ 


17 


6 


75 . 


0 


381 


10 


Q8LNJ6 


Q81nj6 oryza sativ 


18 


6 


75. 


0 


436 


13 


Q 8 AVI 2 


Q8avi2 xenopus lae 


19 


6 


75. 


0 


500 


16 


Q87UP7 


Q87up7 pseudomonas 


20 


6 


75. 


0 


802 


17 


Q97XQ7 


Q97xq7 sulfolobus 


21 


6 


75 . 


0 


822 


5 


Q9VI8 2 


Q9vi82 drosophila 


22 


6 


75 . 


0 


839 


5 


Q961M8 


Q961m8 drosophila 


23 


6 


75 . 


0 


868 


16 


Q9KRY9 


Q9kry9 vibrio chol 


24 


6 


75 . 


0 


894 


16 


Q7WMP8 


Q7wmp8 bordetella 


25 


6 


75 . 


0 


894 


16 


Q7WB80 


Q7wb80 bordetella 


26 


6 


75 . 


0 


1324 


16 


Q8DL57 


Q8dl57 synechococc 


27 


6 


75 . 


0 


1469 


5 


Q8INT5 


Q8int5 drosophila 


28 


6 


75 . 


0 


4639 


5 


Q9VZ8 3 


Q9vz83 drosophila 


29 


6 


75 . 


0 


4998 


11 


Q8CG65 


Q8cg65 mus musculu 


30 


5 


62 . 


5 


46 


16 


Q8 8UM4 


Q88um4 lactobacill 


31 


5 


62 . 


5 


49 


13 


Q9PVI4 


Q9pvi4 gallus gall 


32 


5 


62 . 


5 


50 


2 


Q8KM85 


Q8km85 mycoplasma 


33 


5 


62 . 


5 


55 


16 


Q8CKW2 


Q8ckw2 yersinia pe 


34 


5 


62 . 


5 


63 


4 


Q8WYE8 


Q8wye8 homo sapien 


35 


5 


62 . 


5 


66 


3 


Q96X11 


Q96xll phaeosphaer 


36 


5 


62 . 


5 


67 


6 


Q9TU51 


Q9tu51 sus scrofa 


37 


5 


62 . 


5 


67 


13 


Q9IB55 


Q9ib55 xenopus lae 


38 


5 


62 . 


5 


67 


13 


Q9IB58 


Q9ib58 xenopus lae 


39 


5 


62 . 


5 


68 


16 


Q8CN01 


Q8cn01 staphylococ 


40 


5 


62 . 


5 


70 


1 


Q8NKU7 


Q8nku7 acidianus a 


41 


5 


62 . 


5 


79 


2 


P96055 


P96055 salmonella 


42 


5 


62 . 


5 


84 


2 


Q8KM09 


Q8km09 photorhabdu 


43 


5 


62 . 


5 


84 


13 


Q9W7I7 


Q9w7i7 gallus gall 


44 


5 


62 . 


5 


88 


16 


Q7UR07 


Q7ur07 rhodopirell 


45 


5 


62 . 


5 


89 


5 


Q7YZN2 


Q7yzn2 trichomonas 


46 


5 


62 . 


5 


91 


16 


Q8F784 


Q8f784 leptospira 


47 


5 


62 . 


5 


92 


5 


Q8IS88 


Q8is88 trypanosoma 


48 


5 


62 . 


5 


92 


5 


Q7YZC6 


Q7yzc6 trypanosoma 


49 


5 


62 . 


5 


92 


5 


Q7YZC5 


Q7yzc5 trypanosoma 


50 


5 


62 . 


5 


98 


6 


Q9MYU9 


Q9myu9 canis famil 


51 


5 


62 . 


5 


105 


3 


013706 


013706 schizosacch 


52 


5 


62 . 


5 


105 


16 


Q8ZQG4 


Q8zqg4 salmonella 


53 


5 


62 . 


5 


109 


16 


Q8NT07 


Q8nt07 corynebacte 


54 


5 


62 . 


5 


109 


16 


Q8ZQ87 


Q8zq87 salmonella 


55 


5 


62 . 


5 


110 


3 


Q12338 


Q12338 saccharomyc 


56 


5 


62 . 


5 


110 


16 


Q81L62 


Q81162 bacillus an 


57 


5 


62. 


5 


110 


16 


Q817L0 


Q81710 bacillus ce 



58 


5 


62 . 


5 


110 


17 


Q8PYI4 


Q8pyi4 methanosarc 


59 


5 


62 . 


5 


111 


2 


Q9RIH2 


Q9rih2 xenorhabdus 


60 


5 


62 . 


5 


111 


3 


Q86ZQ6 


Q8 6zq6 heterobasid 


61 


5 


62 . 


5 


111 


3 


Q86ZQ5 


Q8 6zq5 heterobasid 


62 


5 


62 . 


5 


111 


3 


Q86ZQ4 


Q8 6zq4 heterobasid 


63 


5 


62 . 


5 


111 


3 


Q86Z67 


Q8 6z67 heterobasid 


64 


5 


62 . 


5 


111 


5 


Q8I176 


Q8il76 drosophila 


65 


5 


62 . 


5 


112 


16 


Q8ZN06 


Q8zn06 salmonella 


66 


5 


62 . 


5 


117 


11 


Q8BL85 


Q8bl85 mus musculu 


67 


5 


62 . 


5 


119 


5 


Q9N4P2 


Q9n4p2 caenorhabdi 


68 


5 


62 . 


5 


123 


10 


Q852G9 


Q852g9 oryza sativ 


69 


5 


62 . 


5 


126 


16 


Q8Y3M6 


Q8y3m6 listeria mo 


70 


5 


62 . 


5 


129 


2 


Q9S3I8 


Q9s3i8 haemophilias 


71 


5 


62 . 


5 


129 


2 


032626 


032 62 6 haemophilus 


72 


5 


62 . 


5 


130 


2 


Q9EWB7 


Q9ewb7 s t reptomyce 


73 


5 


62 . 


5 


132 


4 


Q9HBS8 


Q9hbs 8 homo sapien 


74 


5 


62 . 


5 


132 


16 


Q8PG55 


Q8pg55 xanthomonas 


75 


5 


62 . 


5 


133 


16 


Q81JK5 


Q81jk5 bacillus an 


76 


5 


62 . 


5 


135 


17 


028910 


028910 archaeoglob 


77 


5 


62 . 


5 


136 


17 


Q9YBS2 


Q9ybs2 aeropyrum p 


78 


5 


62 . 


5 


141 


16 


Q9ANL3 


Q9anl3 bradyrhizob 


79 


5 


62 . 


5 


142 


2 


Q9F0I4 


Q9 f 0i4 rhizobium f 


80 


5 


62 . 


5 


142 


16 


P73462 


P73462 synechocyst 


81 


5 


62 . 


5 


143 


4 


Q9H935 


Q9h935 homo sapien 


82 


5 


62 . 


5 


144 


16 


Q9HVI0 


Q9hvi0 pseudomonas 


83 


5 


62 . 


5 


146 


2 


Q7WX31 


Q7wx31 alcali genes 


84 


5 


62 . 


5 


147 


16 


P74066 


P74066 synechocyst 


85 


5 


62. 


5 


147 


17 


Q8TTG1 


Q8ttgl methanosarc 


86 


5 


62 . 


5 


148 


11 


Q9CXA5 


Q9cxa5 mus musculu 


87 


5 


62 . 


5 


148 


16 


Q8Y906 


Q8y906 listeria mo 


88 


5 


62 . 


5 


151 


16 


Q8ZBQ4 


Q8zbq4 yersinia pe 


89 


5 


62 . 


5 


156 


2 


Q8KY19 


Q8 kyl 9 pseudomonas 


90 


5 


62 . 


5 


156 


2 


Q8KRE2 


Q8 kre2 pseudomonas 


91 


5 


62 . 


5 


156 


16 


Q88EB4 


Q8 8eb4 pseudomonas 


92 


5 


62 . 


5 


159 


2 


Q9F8K6 


Q9f 8k6 carboxydoth 


93 


5 


62. 


5 


166 


16 


069720 


069720 mycobacteri 


94 


5 


62 . 


5 


168 


13 


Q7SYI3 


Q7syi3 gallus gall 


95 


5 


62 . 


5 


168 


16 


Q9PQB0 


Q9pqb0 ureaplasma 


96 


5 


62 . 


5 


170 


16 


Q8KDH8 


Q8kdh8 chlorobium 


97 


5 


62 . 


5 


170 


17 


052035 


052035 halobacteri 


98 


5 


62 . 


5 


171 


3 


Q9HFT2 


Q9hf t2 coriolopsis 


99 


5 


62 . 


5 


172 


16 


Q9PMA5 


Q9pma5 campylobact 


100 


5 


62 . 


5 


172 


16 


Q8CVT1 


Q8cvtl escherichia 


101 


5 


62 . 


5 


172 


16 


Q83K10 


Q83kl0 shigella fl 


102 


5 


62 . 


5 


173 


3 


Q12408 


Q124 08 saccharomyc 


103 


5 


62 . 


5 


173 


16 


Q7VFT1 


Q7vftl helicobacte 


104 


5 


62 . 


5 


173 


16 


Q7TVS0 


Q7tvs0 mycobacteri 


105 


5 


62 . 


5 


174 


2 


Q9S696 


Q9s69 6 haemophilus 


106 


5 


62 . 


5 


174 


16 


Q8DF31 


Q8df31 vibrio vuln 


107 


5 


62 . 


5 


178 


12 


Q98Y37 


Q98y37 reptilian p 


108 


5 


62 . 


5 


178 


12 


Q994S6 


Q994s6 snake atcc- 


109 


5 


62 . 


5 


178 


12 


Q98Y59 


Q98y59 reptilian p 


110 


5 


62 . 


5 


178 


12 


Q994S4 


Q994s4 snake atcc- 


111 


5 


62 . 


5 


178 


12 


Q98Y62 


Q98y62 reptilian p 


112 


5 


62 . 


5 


178 


12 


Q91T11 


Q91tll reptilian p 


113 


5 


62 . 


5 


178 


16 


Q8A635 


Q8a635 bacteroides 


114 


5 


62 . 


5 


182 


2 


Q9S695 


Q9s 69 5 haemophilus 



115 


5 


62 . 


5 


182 


2 


Q9S694 


Q9s694 haemophilus 


116 


5 


62 . 


5 


182 


16 


025859 


025859 helicobacte 


117 


5 


62. 


5 


182 


16 


Q9Z JV8 


Q9zjv8 helicobacte 


118 


5 


62 . 


5 


183 


10 


Q7XWX7 


Q7xwx7 oryza sativ 


119 


5 


62 . 


5 


183 


16 


Q8NPK2 


Q8npk2 corynebacte 


120 


5 


62 . 


5 


184 


2 


Q9S6A0 


Q9s6a0 haemophilus 


121 


5 


62 . 


5 


184 


4 


Q9H875 


Q9h875 homo sapien 


122 


5 


62 . 


5 


185 


2 


Q9S697 


Q9s697 haemophilus 


123 


5 


62 . 


5 


185 


2 


Q9S698 


Q9s698 haemophilus 


124 


5 


62 . 


5 


186 


2 


Q9S6A2 


Q9s6a2 haemophilus 


125 


5 


62 . 


5 


186 


11 


Q9CWV6 


Q9 cwv6 mus mus culu 


126 


5 


62 . 


5 


186 


11 


Q9CY32 


Q9cy32 mus mus culu 


127 


5 


62 . 


5 


187 


12 


Q7TLV5 


Q7tlv5 choristoneu 


128 


5 


62 . 


5 


187 


16 


Q9RGV1 


Q9rgvl salmonella 


129 


5 


62 . 


5 


188 


2 


068806 


068806 synechococc 


130 


5 


62 . 


5 


188 


2 


Q9S6A6 


Q9s6a6 haemophilus 


131 


5 


62. 


5 


188 


2 


Q9S6A8 


Q9s6a8 haemophilus 


132 


5 


62 . 


5 


188 


2 


Q9S6A9 


Q9s6a9 haemophilus 


133 


5 


62 . 


5 


188 


2 


Q9S6A3 


Q9s6a3 haemophilus 


134 


5 


62 . 


5 


188 


2 


Q9S6A5 


Q9s6a5 haemophilus 


135 


5 


62 . 


5 


188 


16 


Q88DZ7 


Q8 8dz7 pseudomonas 


136 


5 


62 . 


5 


189 


2 


Q9S6A7 


Q9s6a7 haemophilus 


137 


5 


62. 


5 


189 


2 


Q9S6A4 


Q9s6a4 haemophilus 


138 


5 


62 . 


5 


189 


2 


Q9S6A1 


Q9s6al haemophilus 


139 


5 


62 . 


5 


192 


4 


Q8WV02 


Q8wv02 homo sapien 


140 


5 


62 . 


5 


192 


4 


Q8WVE9 


Q8wve9 homo sapien 


141 


5 


62 . 


5 


193 


2 


Q9X4F4 


Q9x4f4 haemophilus 


142 


5 


62 . 


5 


193 


16 


Q9A0N1 


Q9a0nl streptococc 


143 


5 


62 . 


5 


193 


16 


Q8NZS3 


Q8nzs3 streptococc 


144 


5 


62 . 


5 


193 


16 


Q938K4 


Q938k4 streptococc 


145 


5 


62. 


5 


195 


2 


Q9S693 


Q9s693 haemophilus 


146 


5 


62 . 


5 


198 


2 


Q8KJS9 


Q8kjs9 pseudomonas 


147 


5 


62 . 


5 


200 


16 


Q8ZLC5 


Q8zlc5 salmonella 


148 


5 


62 . 


5 


200 


16 


Q8 8HM1 


Q8 8hml pseudomonas 


149 


5 


62 . 


5 


200 


16 


Q8Z280 


Q8z280 salmonella 


150 


5 


62 . 


5 


203 


16 


Q7UV97 


Q7uv97 rhodopirell 


151 


5 


62 . 


5 


204 


13 


Q9I9D9 


Q9i9d9 xenopus lae 


152 


5 


62 . 


5 


205 


2 


Q9RNG7 


Q9rng7 legionella 


153 


5 


62 . 


5 


206 


16 


Q82I89 


Q82189 streptomyce 


154 


5 


62 . 


5 


208 


13 


Q7SZC6 


Q7szc6 gallus gall 


155 


5 


62 . 


5 


208 


16 


Q92WD5 


Q92wd5 rhizobium m 


156 


5 


62 . 


5 


208 


17 


Q8TUQ3 


Q8 tuq3 methanosar c 


157 


5 


62 . 


5 


209 


2 


Q8GMR9 


Q8gmr9 synechococc 


158 


5 


62 . 


5 


209 


2 


Q8GFP4 


Q8gfp4 citrobacter 


159 


5 


62 . 


5 


209 


5 


Q9U504 


Q9u504 manduca sex 


160 


5 


62 . 


5 


210 


2 


Q8RLA9 


Q8rla9 lactobacill 


161 


5 


62 . 


5 


210 


16 


Q88YJ6 


Q88yj6 lactobacill 


162 


5 


62. 


5 


211 


16 


Q8EQ35 


Q8eq35 oceanobacil 


163 


5 


62. 


5 


212 


2 


Q9FA34 


Q9fa34 salmonella 


164 


5 


62 . 


5 


212 


2 


Q9K2H0 


Q9k2h0 streptococc 


165 


5 


62 . 


5 


212 


16 


Q9RT20 


Q9rt2 0 deino coccus 


166 


5 


62 . 


5 


212 


16 


Q8EEW7 


Q8eew7 shewanella 


167 


5 


62 . 


5 


212 


17 


Q8PX99 


Q8px99 methanosarc 


168 


5 


62 . 


5 


214 


10 


Q7XVQ9 


Q7xvq9 oryza sativ 


169 


5 


62 . 


5 


214 


16 


Q9ABK9 


Q9abk9 caulobacter 


170 


5 


62 . 


5 


216 


10 


Q9FY89 


Q9f y8 9 arabidopsis 


171 


5 


62 . 


5 


217 


16 


Q8XBX3 


Q8xbx3 escherichia 



172 


5 


62 . 


5 


217 


16 


Q8Z8M5 


Q8z8m5 salmonella 


173 


5 


62 . 


5 


217 


16 


Q7UDG3 


Q7udg3 shigella fl 


174 


5 


62 . 


5 


219 


2 


Q9X5A5 


Q9x5a5 treponema d 


175 


5 


62 . 


5 


222 


16 


Q9HWM7 


Q9hwm7 pseudomonas 


176 


5 


62 . 


5 


226 


17 


Q8TP55 


Q8tp55 methanosarc 


177 


5 


62 . 


5 


227 


2 


087933 


087933 streptococc 


178 


5 


62 . 


5 


227 


16 


Q8R798 


Q8r798 thermoanaer 


179 


5 


62 . 


5 


228 


5 


Q8 6ML6 


Q86ml6 drosophila 


180 


5 


62 . 


5 


228 


8 


Q7Y8X0 


Q7y8x0 portunus tr 


181 


5 


62 . 


5 


228 


10 


Q7XKU0 


Q7xku0 oryza sativ 


182 


5 


62 . 


5 


228 


16 


Q97MP4 


Q97mp4 Clostridium 


183 


5 


62 . 


5 


230 


16 


Q8DCR4 


Q8dcr4 vibrio vuln 


184 


5 


62 . 


5 


231 


16 


Q98HL6 


Q98hl6 rhizobium 1 


185 


5 


62 . 


5 


232 


16 


Q9ZHS1 


Q9zhsl brucella ab 


186 


5 


62 . 


5 


232 


16 


Q8FZ93 


Q8fz93 brucella su 


187 


5 


62 . 


5 


232 


17 


Q8TJF1 


Q8tjfl methanosarc 


188 


5 


62 . 


5 


233 


2 


Q84DC5 


Q84dc5 pseudomonas 


189 


5 


62 . 


5 


233 


9 


080122 


080122 bacteriopha 


190 


5 


62 . 


5 


233 


16 


Q9FD95 


Q9fd95 rhizobium m 


191 


5 


62 . 


5 


234 


16 


Q8UCP9 


Q8ucp9 agrobacteri 


192 


5 


62 . 


5 


235 


10 


Q8S3V0 


Q8s3v0 sandersonia 


193 


5 


62. 


5 


235 


16 


Q87AL8 


Q87al8 xylella fas 


194 


5 


62 . 


5 


235 


16 


Q7UII5 


Q7uii5 rhodopirell 


195 


5 


62 . 


5 


237 


2 


Q9RNH4 


Q9rnh4 rhodobacter 


196 


5 


62 . 


5 


237 


10 


Q9FMM5 


Q9fmm5 arabidopsis 


197 


5 


62 . 


5 


237 


17 


Q97UC2 


Q97uc2 sulfolobus 


198 


5 


62 . 


5 


238 


16 


Q87L39 


Q87139 vibrio para 


199 


5 


62 . 


5 


239 


16 


Q92L63 


Q92163 rhizobium m 


200 


5 


62 . 


5 


239 


16 


Q7UFQ9 


Q7ufq9 rhodopirell 


201 


5 


62 . 


5 


239 


16 


Q7TXW2 


Q7txw2 mycobacteri 


202 


5 


62 . 


5 


240 


16 


Q9KV03 


Q9kv03 vibrio chol 


203 


5 


62 . 


5 


240 


16 


Q81VD4 


Q81vd4 bacillus an 


204 


5 


62 . 


5 


240 


16 


Q81IR5 


Q81ir5 bacillus ce 


205 


5 


62 . 


5 


242 


16 


Q988K7 


Q988K7 rhizobium 1 


206 


5 


62 . 


5 


246 


16 


Q8UA4 4 


Q8ua44 agrobacteri 


207 


5 


62 . 


5 


251 


16 


Q7UYX9 


Q7uyx9 rhodopirell 


208 


5 


62 . 


5 


252 


2 


Q44226 


Q4 422 6 anabaena sp 


209 


5 


62 . 


5 


254 


16 


Q8YKI8 


Q8yki8 anabaena sp 


210 


5 


62 . 


5 


254 


16 


Q8E6W0 


Q8e6w0 streptococc 


211 


5 


62 . 


5 


256 


16 


Q8XID9 


Q8xid9 Clostridium 


212 


5 


62 . 


5 


257 


11 


Q8C270 


Q8c270 mus musculu 


213 


5 


62 . 


5 


258 


2 


Q9ZB90 


Q9zb90 mycobacteri 


214 


5 


62 . 


5 


259 


10 


Q9ZS05 


Q9zs05 arabidopsis 


215 


5 


62 . 


5 


259 


11 


Q9D5K1 


Q9d5kl mus musculu 


216 


5 


62 . 


5 


259 


13 


Q7SYP7 


Q7syp7 xenopus lae 


217 


5 


62 . 


5 


2 62 


2 


Q50381 


Q50381 mycobacteri 


218 
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Q8vus9 paracoccus 


443 


5 


62 . 


5 


486 


11 


Q8R584 


Q8r584 mus musculu 


444 


5 


62 . 


5 


490 


16 


Q9A2Z5 


Q9a2 z5 caulobacter 


445 


5 


62 . 


5 


490 


16 


Q8EE98 


Q8ee98 shewanella 


446 


5 


62 . 


5 


493 


5 


Q8T8T5 


Q8t8t5 drosophila 


447 


5 


62 . 


5 


493 


10 


Q9LRA5 


Q91ra5 arabidopsis 


448 


5 


62 . 


5 


493 


10 


Q 8 LMT 0 


Q81mt0 oryza sativ 


449 


5 


62. 


5 


494 


17 


029412 


02 9412 archaeoglob 


450 


5 


62. 


5 


495 


11 


Q8BZE5 


Q8bze5 mus musculu 


451 


5 


62 . 


5 


497 


5 


Q18516 


Q18516 caenorhabdi 


452 


5 


62 . 


5 


499 


16 


Q55373 


Q55373 synechocyst 


453 


5 


62 . 


5 


500 


17 


028844 


02 8 84 4 archaeoglob 


454 


5 


62 . 


5 


504 


10 


Q9S9Z1 


Q9s 9zl arabidopsis 


455 


5 


62 . 


5 


504 


11 


Q9JM73 


Q9jm73 mus musculu 


456 


5 


62 . 


5 


507 


10 


Q8LMJ9 


Q81mj9 oryza sativ 



457 


5 


62. 


5 


507 


16 


Q8 2CV0 


Q82cv0 streptomyce 


458 


5 


62 . 


5 


509 


9 


Q9ZX71 


Q9zx71 mycobacteri 


459 


5 


62 . 


5 


510 


11 


Q8CFS9 


Q8cfs9 mus musculu 


460 


5 


62 . 


5 


511 


16 


Q8G5R8 


Q8g5r8 bifidobacte 


461 


5 


62 . 


5 


514 


2 


Q8KJG3 


Q8kjg3 rhizobium 1 


462 


5 


62. 


5 


514 


16 


Q98A94 


Q98a94 rhizobium 1 


463 


5 


62 . 


5 


518 


16 


Q7V4Z0 


Q7v4z0 prochloroco 


464 


5 


62 . 


5 


524 


16 


Q9ADN6 


Q9adn6 streptomyce 


465 


5 


62 . 


5 


526 


2 


Q9RQ03 


Q9rq03 xanthobacte 


466 


5 


62 . 


5 


527 


16 


Q894S3 


Q8 94s3 Clostridium 


467 


5 


62 . 


5 


530 


16 


Q9CIK5 


Q9cik5 lactococcus 


468 


5 


62 . 


5 


532 


3 


Q8NK00 


Q8nk00 trichoderma 


469 


5 


62. 


5 


533 


11 


Q8BQL2 


Q8bql2 mus musculu 


470 


5 


62 . 


5 


533 


16 


Q98F95 


Q98f95 rhizobium 1 


471 


5 


62 . 


5 


533 


16 


Q8FZ18 


Q8fzl8 brucella su 


472 


5 


62 . 


5 


538 


16 


Q8YIU3 


Q8yiu3 brucella me 


473 


5 


62 . 


5 


539 


4 


Q9NW4 5 


Q9nw45 homo sapien 


474 


5 


62 . 


5 


539 


4 


Q9NWD6 


Q9nwd6 homo sapien 


475 


5 


62 . 


5 


541 


16 


Q8G3E2 


Q8g3e2 brucella su 


476 


5 


62 . 


5 


541 


16 


Q7VA17 


Q7val7 prochloroco 


477 


5 


62 . 


5 


545 


10 


Q8S9K1 


Q8s9kl arabidopsis 


478 


5 


62 . 


5 


546 


16 


Q9AA4 5 


Q9aa45 caulobacter 


479 


5 


62 . 


5 


548 


17 


Q9HJZ5 


Q9hjz5 thermoplasm 


480 


5 


62 . 


5 


550 


16 


Q8YEE0 


Q8yee0 brucella me 


481 


5 


62 . 


5 


554 


10 


Q9S9X4 


Q9s9x4 arabidopsis 


482 


5 


62 . 


5 


555 


5 


Q20402 


Q2 04 02 caenorhabdi 


483 


5 


62 . 


5 


555 


11 


Q8CDY1 


Q8cdyl mus musculu 


484 


5 


62 . 


5 


556 


4 


Q96E57 


Q96e57 homo sapien 


485 


5 


62 . 


5 


556 


5 


015929 


015 92 9 Plasmodium 


486 


5 


62 . 


5 


557 


13 


Q90Z52 


Q90z52 brachydanio 


487 


5 


62 . 


5 


557 


16 


Q8D4Y0 


Q8d4y0 vibrio vuln 


488 


5 


62 . 


5 


560 


16 


Q8F6P3 


Q8f6p3 leptospira 


489 


5 


62 . 


5 


561 


5 


Q95P96 


Q95p96 patinopecte 


490 


5 


62 . 


5 


567 


4 


Q8N8E3 


Q8n8e3 homo sapien 


491 


5 


62 . 


5 


570 


5 


Q9U2R1 


Q9u2 r 1 caenorhabdi 


492 


5 


62 . 


5 


570 


11 


Q8CIH1 


Q8cihl mus musculu 


493 


5 


62 . 


5 


570 


11 


Q8QZX2 


Q8qzx2 mus musculu 


494 


5 


62 . 


5 


570 


16 


Q97IV7 


Q97iv7 Clostridium 


495 


5 


62 . 


5 


571 


4 


Q9BSW6 


Q9bsw6 homo sapien 


496 


5 


62 . 


5 


572 


5 


Q9XXH4 


Q9xxh4 caenorhabdi 


497 


5 


62 . 


5 


579 


4 


Q9H2N2 


Q9h2n2 homo sapien 


498 


5 


62 . 


, 5 


579 


17 


Q9V1X6 


Q9vlx6 pyrococcus 


499 


5 


62 . 


, 5 


580 


17 


059381 


059381 pyrococcus 


500 


5 


62 . 


.5 


581 


16 


Q9ADI5 


Q9adi5 streptomyce 


501 


5 


62 . 


. 5 


581 


16 


Q82L53 


Q82153 streptomyce 


502 


5 


62 . 


, 5 


582 


16 


033678 


033678 rhizobium m 


503 


5 


62 . 


. 5 


586 


10 


Q93XM8 


Q93xm8 malus domes 


504 


5 


62 . 


. 5 


591 


11 


Q80XI7 


Q80xi7 mus musculu 


505 


5 


62 . 


. 5 


596 


2 


Q56360 


Q56360 thiocapsa r 


506 


5 


62 , 


.5 


597 


10 


081103 


O81103 prunus arme 


507 


5 


62 . 


. 5 


597 


16 


Q9PPX0 


Q9ppx0 ureaplasma 


508 


5 


62 . 


. 5 


600 


5 


Q9V7T7 


Q9v7t7 drosophila 


509 


5 


62 . 


.5 


604 


5 


Q8IGT9 


Q8igt9 drosophila 


510 


5 


62 , 


. 5 


605 


16 


Q8DGW0 


Q8dgw0 synechococc 


511 


5 


62 , 


. 5 


607 


16 


Q9F2E0 


Q9f2e0 salmonella 


512 


5 


62 


.5 


607 


16 


Q9F2I6 


Q9f2i6 salmonella 


513 


5 


62 


. 5 


607 


16 


Q8Z JR3 


Q8zjr3 yersinia pe 



514 


5 


62 . 


5 


607 


16 


Q83SU6 


Q83su6 salmonella 


515 


5 


62 . 


5 


608 


10 


Q9FFD4 


Q9ffd4 arabidopsis 


516 


5 


62 . 


5 


611 


5 


Q23454 


Q23454 caenorhabdi 


517 


5 


62 . 


5 


614 


4 


Q8WU94 


Q8wu94 homo sapien 


518 


5 


62 . 


5 


614 


4 


Q96SR8 


Q96sr8 homo sapien 


519 


5 


62 . 


5 


615 


16 


Q8UAJ0 


Q8uaj0 agrobacteri 


520 


5 


62 . 


5 


616 


5 


Q8IQZ8 


Q8iqz8 drosophila 


521 


5 


62 . 


5 


618 


3 


Q96TI3 


Q96ti3 lentinula e 


522 


5 


62 . 


5 


619 


10 


Q9M648 


Q9m648 arabidopsis 


523 


5 


62 . 


5 


619 


10 


Q9C9W9 


Q9c9w9 arabidopsis 


524 


5 


62 . 


5 


622 


2 


Q51862 


Q518 62 pseudomonas 


525 


5 


62 . 


5 


622 


16 


Q823G8 


Q823g8 chlamydophi 


526 


5 


62 . 


5 


623 


17 


Q8PXC9 


Q8pxc9 methanosarc 


527 


5 


62 . 


5 


625 


11 


Q8C6N7 


Q8c6n7 mus musculu 


528 


5 


62 . 


5 


626 


10 


Q9SW44 


Q9sw44 arabidopsis 


529 


5 


62 . 


5 


626 


16 


Q8ZCN4 


Q8zcn4 yersinia pe 


530 


5 


62 . 


5 


635 


11 


055002 


O55002 mus musculu 


531 


5 


62 . 


5 


635 


16 


Q93IU4 


Q93iu4 streptomyce 


532 


5 


62 . 


5 


637 


16 


Q55413 


Q55413 synechocyst 


533 


5 


62 . 


5 


637 


16 


Q8EFU5 


Q8efu5 shewanella 


534 


5 


62 . 


5 


639 


2 


Q9AI10 


Q9ail0 photorhabdu 


535 


5 


62 . 


5 


640 


5 


061604 


061604 drosophila 


536 


5 


62 . 


5 


640 


5 


Q9NAP5 


Q 9 nap 5 caenorhabdi 


537 


5 


62 . 


5 


641 


5 


Q8IQZ7 


Q8iqz7 drosophila 


538 


5 


62 . 


5 


644 


12 


Q90063 


Q90063 nudaurelia 


539 


5 


62 . 


5 


646 


5 


Q25017 


Q25017 helix asper 


540 


5 


62 . 


5 


650 


10 


Q9SGR9 


Q9sgr9 arabidopsis 


541 


5 


62 . 


5 


657 


11 


Q8CGJ9 


Q8cgj9 mus musculu 


542 


5 


62 . 


5 


658 


11 


Q8BFS4 


Q8bfs4 mus musculu 


543 


5 


62 . 


5 


663 


11 


070434 


070434 mus musculu 


544 


5 


62 . 


5 


668 


10 


Q9LSS0 


Q91ss0 arabidopsis 


545 


5 


62 . 


5 


672 


10 


Q9ZWB1 


Q9zwbl arabidopsis 


546 


5 


62 . 


5 


672 


10 


Q851R1 


Q851rl oryza sativ 


547 


5 


62 . 


5 


673 


4 


Q8IWK1 


Q8iwkl homo sapien 


548 


5 


62 . 


5 


680 


11 


055001 


055001 mus musculu 


549 


5 


62 . 


5 


680 


16 


Q89UJ2 


Q89uj2 bradyrhizob 


550 


5 


62. 


5 


686 


16 


Q92QA5 


Q92qa5 rhizobium m 


551 


5 


62 . 


5 


691 


10 


Q9FHX3 


Q9fhx3 arabidopsis 


552 


5 


62 . 


5 


694 


16 


Q8A3X0 


Q8a3x0 bacteroides 


553 


5 


62 . 


5 


695 


8 


Q8HSD0 


Q8hsd0 solanum hav 


554 


5 


62 . 


5 


697 


11 


Q8K4V6 


Q8k4v6 mus musculu 


555 


5 


62 . 


5 


697 


16 


Q8EB65 


Q8eb65 shewanella 


556 


5 


62 . 


5 


698 


5 


Q7YU82 


Q7yu82 drosophila 


557 


5 


62 . 


5 


705 


5 


Q7YYO0 


Q7yy00 cryptospori 


558 


5 


62 . 


5 


709 


16 


Q8P6C0 


Q8p6c0 xanthomonas 


559 


5 


62 . 


5 


713 


16 


Q8PHS3 


Q8phs3 xanthomonas 


560 


5 


62 . 


.5 


714 


16 


Q97FP1 


Q97fpl Clostridium 


561 


5 


62 . 


.5 


716 


10 


Q8H8C1 


Q8h8cl oryza sativ 


562 


5 


62. 


.5 


718 


10 


Q93ZR5 


Q93zr5 arabidopsis 


563 


5 


62 


.5 


718 


10 


Q9LSR8 


Q91sr8 arabidopsis 


564 


5 


62 


.5 


719 


5 


Q9VM92 


Q9vm92 drosophila 


565 


5 


62 


. 5 


725 


5 


Q86J51 


Q86j51 dictyosteli 


566 


5 


62 


. 5 


726 


2 


Q9LAE4 


Q91ae4 azospirillu 


567 


5 


62 


.5 


726 


10 


Q7Y1Z5 


Q7ylz5 arabidopsis 


568 


5 


62 


. 5 


738 


5 


015646 


015646 dictyosteli 


569 


5 


62 


.5 


738 


5 


Q86IQ2 


Q86iq2 dictyosteli 


570 


5 


62 


.5 


742 


2 


068598 


0685 9 8 pseudomonas 



571 


5 


62 .5 


742 


12 


Q9WAU9 


Q9wau9 broad bean 


572 


5 


62 . 5 


742 


12 


Q9WAU7 


Q9wau7 broad bean 


573 


5 


62 . 5 


742 


12 


Q9WAU8 


Q9wau8 broad bean 


574 


5 


62 .5 


742 


16 


Q9I527 


Q91527 pseudomonas 


575 


5 


62 . 5 


742 


16 


Q8EUW3 


Q8euw3 mycoplasma 


576 


5 


62 . 5 


749 


10 


Q9LE23 


Q91e23 oryza sativ 


577 


5 


62 . 5 


753 


16 


Q8Y5E3 


Q8y5e3 listeria mo 


578 


5 


62 . 5 


757 


10 


Q8RXQ7 


Q8rxq7 arabidopsis 


579 


5 


62 . 5 


763 


16 


Q8FMM3 


Q8fmm3 corynebacte 


580 


5 


62 . 5 


768 


10 


Q9LHU0 


Q91hu0 oryza sativ 


581 


5 


62 .5 


770 


16 


Q8U9Q3 


Q8u9q3 agrobacteri 


582 


5 


62 .5 


771 


13 


Q9I8K4 


Q9i8k4 rana catesb 


583 


5 


62 . 5 


771 


16 


Q89N45 


Q89n45 bradyrhizob 


584 


5 


62 . 5 


772 


5 


Q8IJD2 


Q8ijd2 Plasmodium 


585 


5 


62 . 5 


779 


3 


014325 


014325 schizosacch 


586 


5 


62 .5 


780 


10 


Q9MA68 


Q9ma68 arabidopsis 


587 


5 


62 .5 


780 


16 


Q8PI15 


Q8pil5 xanthomonas 


588 


5 


62 .5 


781 


16 


Q8P6S1 


Q8p6sl xanthomonas 


589 


5 


62 .5 


782 


5 


Q8WQC0 


Q8wqc0 caenorhabdi 


590 


5 


62 . 5 


783 


3 


014130 


014130 schizosacch 


591 


5 


62 . 5 


784 


16 


Q8PN94 


Q8pn94 xanthomonas 


592 


5 


62 .5 


788 


16 


Q8DR28 


Q8dr28 streptococc 


593 


5 


62 .5 


789 


2 


Q9F599 


Q9f599 agrobacteri 


594 


5 


62 .5 


790 


16 


Q9PF36 


Q9pf36 xylella fas 


595 


5 


62 . 5 


790 


16 


Q9KS24 


Q9ks24 vibrio chol 


596 


5 


62 . 5 


790 


16 


Q8 7AJ3 


Q87aj3 xylella fas 


597 


5 


62 .5 


791 


10 


Q94HY0 


Q94hy0 oryza sativ 


598 


5 


62 .5 


791 


10 


Q7XCS1 


Q7xcsl oryza sativ 


599 


5 


62 .5 


793 


12 


Q64787 


Q64787 avian adeno 


600 


5 


62 . 5 


796 


16 


Q9Z4W5 


Q9z4w5 streptomyce 


601 


5 


62 . 5 


799 


13 


Q7SYH9 


Q7syh9 gallus gall 


602 


5 


62 .5 


803 


5 


Q8IBJ5 


Q8ibj5 Plasmodium 


603 


5 


62.5 


805 


5 


Q8MLR1 


Q8mlrl drosophila 


604 


5 


62 . 5 


808 


10 


Q9LNV7 


Q91nv7 arabidopsis 


605 


5 


62 . 5 


815 


16 


Q9CPD4 


Q9cpd4 pasteurella 


606 


5 


62 . 5 


819 


5 


Q95Q49 


Q95q49 caenorhabdi 


607 


5 


62.5 


819 


16 


Q8ZIN6 


Q8zin6 yersinia pe 


608 


5 


62 . 5 


823 


10 


Q9MA7 0 


Q9ma70 arabidopsis 


609 


5 


62 . 5 


824 


3 


Q9Y7F1 


Q9y7fl aspergillus 


610 


5 


62 . 5 


824 


12 


012663 


012663 broad bean 


611 


5 


62 . 5 


827 


3 


043051 


043051 schizosacch 


612 


5 


62 . 5 


837 


5 


062329 


062329 caenorhabdi 


613 


5 


62 . 5 


841 


11 


Q8R2Z1 


Q8r2zl mus musculu 


614 


5 


62 . 5 


841 


11 


054999 


054999 mus musculu 


615 


5 


62 . 5 


841 


11 


P97484 


P97484 mus musculu 


616 


5 


62 . 5 


841 


16 


Q83HB7 


Q83hb7 tropheryma 


617 


5 


62 . 5 


841 


16 


Q83FM2 


Q8 3fm2 tropheryma 


618 


5 


62 . 5 


843 


5 


002264 


002264 caenorhabdi 


619 


5 


62 . 5 


845 


5 


Q8WPP8 


Q8wpp8 trypanosoma 


620 


5 


62 . 5 


845 


16 


Q8CZP2 


Q8czp2 yersinia pe 


621 


5 


62 . 5 


846 


16 


Q9PN76 


Q9pn7 6 campylobact 


622 


5 


62 . 5 


847 


16 


Q8EEC3 


Q8eec3 shewanella 


623 


5 


62 . 5 


849 


10 


Q9SHG6 


Q9shg6 arabidopsis 


624 


5 


62 . 5 


853 


11 


Q8C0R9 


Q8c0r9 mus musculu 


625 


5 


62 . 5 


860 


5 


Q9NAB5 


Q9nab5 caenorhabdi 


626 


5 


62 . 5 


863 


16 


Q7U078 


Q7u078 mycobacteri 


627 


5 


62. 5 


866 


5 


Q9VF2 0 


Q9vf20 drosophila 



628 


5 


62 . 


5 


870 


5 


P91346 


P91346 caenorhabdi 


62 9 


5 


62 . 


5 


874 


2 


Q9S3L0 


Q9s310 Clostridium 


630 


5 


62 . 


5 


877 


16 


086365 


086365 mycobacteri 


631 


5 


62 . 


5 


877 


16 


Q7U1Q1 


Q7ulql mycobacteri 


632 


5 


62. 


5 


882 


11 


Q9R206 


Q9r206 mus musculu 


633 


5 


62 . 


5 


882 


11 


070353 


070353 mus musculu 


634 


5 


62 . 


5 


884 


6 


097583 


097583 bos taurus 


635 


5 


62 . 


5 


884 


16 


Q8P5B4 


Q8p5b4 xanthomonas 


636 


5 


62 . 


5 


885 


16 


Q8PPJ3 


Q8pp j 3 xanthomonas 


637 


5 


62 . 


5 


886 


3 


Q875Q0 


Q875q0 saccharomyc 


638 


5 


62 . 


5 


894 


10 


Q8VYW8 


Q8vyw8 arabidopsis 


639 


5 


62 . 


5 


898 


5 


Q8SV30 


Q8sv30 encephalito 


640 


5 


62 . 


5 


904 


5 


Q8MLR0 


Q8mlr0 drosophila 


641 


5 


62 . 


5 


908 


10 


Q7XXE6 


Q7xxe6 oryza sativ 


642 


5 


62 . 


5 


911 


4 


Q7Z7D0 


Q7z7d0 homo sapien 


643 


5 


62 . 


5 


914 


5 


Q9W1L9 


Q9wll9 drosophila 


644 


5 


62 . 


5 


915 


16 


Q8EUK1 


Q8eukl mycoplasma 


645 


5 


62 . 


5 


917 


3 


Q8X032 


Q8x032 neurospora 


646 


5 


62 . 


5 


917 


4 


Q8NFN4 


Q8nfn4 homo sapien 


647 


5 


62 . 


5 


917 


4 


Q8WXA0 


Q8wxa0 homo sapien 


648 


5 


62 . 


5 


917 


4 


Q8IUQ1 


Q8iuql homo sapien 


649 


5 


62 . 


5 


922 


16 


Q89ST3 


Q89st3 bradyrhizob 


650 


5 


62. 


5 


930 


11 


Q925G9 


Q925g9 mus musculu 


651 


5 


62. 


5 


931 


16 


Q8RD56 


Q8 rd5 6 thermoanaer 


652 


5 


62 . 


5 


933 


5 


P91129 


P 9 112 9 caenorhabdi 


653 


5 


62 . 


5 


937 


12 


Q9Q2Q2 


Q9q2q2 broad bean 


654 


5 


62 . 


5 


937 


12 


Q9WDH0 


Q9wdh0 broad bean 


655 


5 


62. 


5 


955 


5 


Q19945 


Q19945 caenorhabdi 


656 


5 


62. 


5 


956 


16 


Q7WDA9 


Q7wda9 bordetella 


657 


5 


62 . 


5 


956 


16 


Q7VZ18 


Q7vzl8 bordetella 


658 


5 


62 . 


5 


957 


16 


Q7W5S2 


Q7w5s2 bordetella 


659 


5 


62 . 


5 


967 


11 


Q920Q9 


Q920q9 rattus norv 


660 


5 


62 . 


5 


969 


17 


Q8U1U6 


Q8ulu6 pyrococcus 


661 


5 


62 . 


5 


976 


16 


Q9AB92 


Q9ab92 caulobacter 


662 


5 


62 . 


5 


985 


2 


P94324 


P9432 4 bradyrhizob 


663 


5 


62 . 


5 


985 


16 


Q89X63 


Q89x63 bradyrhizob 


664 


5 


62 . 


5 


989 


2 


Q8KTI7 


Q8kti7 methylobact 


665 


5 


62 . 


5 


991 


11 


Q8C0M0 


Q8c0m0 mus musculu 


666 


5 


62 . 


5 


997 


5 


Q967Z1 


Q967zl trypanosoma 


667 


5 


62 . 


5 


1001 


10 


Q8LSP2 


Q81sp2 oryza sativ 


668 


5 


62 . 


5 


1002 


10 


Q9ARQ7 


Q9arq7 oryza sativ 


669 


5 


62 . 


5 


1008 


16 


Q9PEJ2 


Q9pej2 xylella fas 


670 


5 


62 . 


5 


1008 


16 


Q87EI9 


Q87ei9 xylella fas 


671 


5 


62 . 


5 


1013 


17 


Q97C94 


Q97c94 thermoplasm 


672 


5 


62 . 


5 


1017 


5 


Q8IPL3 


Q8ipl3 drosophila 


673 


5 


62 . 


5 


1017 


10 


Q9LKV5 


Q91kv5 arabidopsis 


674 


5 


62 . 


5 


1042 


5 


Q9W4W2 


Q9w4w2 drosophila 


675 


5 


62 . 


5 


1042 


5 


076911 


076911 drosophila 


676 


5 


62 . 


5 


1042 


5 


Q9UB2 0 


Q9ub20 drosophila 


677 


5 


62 . 


. 5 


1042 


16 


Q97G63 


Q97g63 Clostridium 


678 


5 


62 . 


. 5 


1047 


10 


Q9FY54 


Q9fy54 arabidopsis 


679 


5 


62 . 


. 5 


1053 


11 


Q80YP5 


Q80yp5 mus musculu 


680 


5 


62 , 


. 5 


1055 


3 


Q9C1J9 


Q9clj9 cryptococcu 


681 


5 


62 


.5 


1064 


12 


Q9IR41 


Q9ir41 broad bean 


682 


5 


62 


.5 


1064 


12 


Q9YM09 


Q9ym09 broad bean 


683 


5 


62 


.5 


1064 


12 


Q9WAU6 


Q9wau6 broad bean 


684 


5 


62 


. 5 


1064 


12 


Q9IZU4 


Q9izu4 broad bean 



685 


5 


62 . 


5 


1064 


12 


Q9Z006 


Q9z006 patchouli m 


686 


5 


62 . 


5 


1065 


12 


Q9Q2Q3 


Q9q2q3 broad bean 


687 


5 


62 . 


5 


1072 


16 


Q98LC1 


Q981cl rhizobium 1 


688 


5 


62 . 


5 


1093 


4 


Q8NFN5 


Q8nfn5 homo sapien 


689 


5 


62 . 


5 


1100 


10 


Q9M319 


Q9m319 arabidopsis 


690 


5 


62 . 


5 


1105 


11 


Q8BMJ4 


Q8bmj4 mus musculu 


691 


5 


62 . 


5 


1115 


4 


075189 


075189 homo sapien 


692 


5 


62 . 


5 


1115 


5 


017267 


0172 67 caenorhabdi 


693 


5 


62 . 


5 


1130 


5 


Q8ITY8 


Q8ity8 caenorhabdi 


694 


5 


62 . 


5 


1136 


16 


Q88SK2 


Q8 8sk2 lactobacill 


695 


5 


62 . 


5 


1140 


15 


Q8 J6D9 


Q8j6d9 simian foam 


696 


5 


62. 


5 


1143 


15 


012817 


012 817 human spuma 


697 


5 


62 . 


5 


1146 


15 


Q87040 


Q87040 simian foam 


698 


5 


62 . 


5 


1169 


11 


Q9QXH4 


Q9qxh4 mus musculu 


699 


5 


62 . 


5 


1178 


16 


Q7ZAL0 


Q7zal0 streptococc 


700 


5 


62 . 


5 


1179 


16 


Q97QG7 


Q97qg7 streptococc 


701 


5 


62 . 


5 


1181 


5 


Q9VBP3 


Q9vbp3 drosophila 


702 


5 


62 . 


5 


1181 


5 


Q9XZ37 


Q9xz37 drosophila 


703 


5 


62 . 


5 


1198 


4 


Q8IVE1 


Q8ivel homo sapien 


704 


5 


62 . 


5 


1200 


11 


Q8BGF7 


Q8bgf7 mus musculu 


705 


5 


62 . 


5 


1210 


10 


Q93ZM5 


Q9 3zm5 arabidopsis 


706 


5 


62 . 


5 


1241 


5 


Q9W3L1 


Q9w311 drosophila 


707 


5 


62 . 


5 


1243 


16 


Q9RDG5 


Q9rdg5 streptomyce 


708 


5 


62 . 


5 


1249 


16 


Q8REU9 


Q8reu9 fusobacteri 


709 


5 


62 . 


5 


1251 


10 


Q9MA34 


Q9ma34 arabidopsis 


710 


5 


62 . 


5 


1257 


16 


Q82CP4 


Q82cp4 streptomyce 


711 


5 


62 . 


5 


1275 


17 


Q8Q061 


Q8q061 methanosarc 


712 


5 


62 . 


5 


1292 


16 


Q87LC6 


Q871c6 vibrio para 


713 


5 


62 . 


5 


1301 


12 


Q91LK4 


Q911k4 white spot 


714 


5 


62 . 


5 


1301 


12 


Q918H8 


Q918h8 white spot 


715 


5 


62 . 


5 


1301 


16 


Q8 9W2 8 


Q8 9w2 8 bradyrhizob 


716 


5 


62. 


5 


1324 


16 


Q82VX1 


Q82vxl nitrosomona 


717 


5 


62 . 


5 


1327 


4 


Q9Y2L2 


Q9y212 homo sapien 


718 


5 


62 . 


5 


1330 


2 


Q55284 


Q55284 synechocyst 


719 


5 


62 . 


5 


1365 


2 


Q8KWD2 


Q8kwd2 ruegeria sp 


720 


5 


62 . 


5 


1386 


5 


Q9GRS8 


Q9grs 8 leishmania 


721 


5 


62. 


5 


1399 


5 


Q9V4R0 


Q9v4r0 drosophila 


722 


5 


62 . 


5 


1449 


5 


Q8T675 


Q8t675 dictyosteli 


723 


5 


62 . 


5 


1523 


4 


Q7Z3K4 


Q7z3k4 homo sapien 


724 


5 


62 . 


5 


1527 


5 


Q8I425 


Q81425 plasmodium 


725 


5 


62 . 


5 


1527 


16 


Q7V505 


Q7v505 prochloroco 


726 


5 


62 . 


5 


1529 


5 


Q9W4Y9 


Q9w4y9 drosophila 


727 


5 


62 . 


5 


1566 


5 


Q9XUF9 


Q9xuf9 caenorhabdi 


728 


5 


62 . 


5 


1569 


16 


Q88KK3 


Q88kk3 pseudomonas 


729 


5 


62 . 


5 


1623 


5 


Q9U3U7 


Q9u3u7 anopheles g 


730 


5 


62 . 


5 


1724 


5 


Q9N583 


Q9n583 caenorhabdi 


731 


5 


62 . 


5 


1758 


5 


Q95Y84 


Q95y84 caenorhabdi 


732 


5 


62 . 


5 


1773 


5 


Q8WPD8 


Q8wpd8 ciona intes 


733 


5 


62 . 


5 


1815 


16 


Q92VA6 


Q92va6 rhizobium m 


734 


5 


62 . 


5 


1852 


3 


013281 


013281 emericella 


735 


5 


62 . 


5 


1857 


3 


Q8TGC5 


Q8tgc5 aspergillus 


736 


5 


62 . 


5 


1932 


16 


Q7UQ94 


Q7uq94 rhodopirell 


737 


5 


62 . 


5 


2028 


8 


Q32064 


Q32 0 64 chlamydomon 


738 


5 


62 . 


5 


2102 


5 


Q86IW7 


Q86iw7 d similar t 


739 


5 


62 . 


5 


2190 


12 


Q915A5 


Q915a5 pichinde ar 


740 


5 


62 . 


5 


2195 


12 


Q8VAB2 


Q8vab2 white spot 


741 


5 


62 . 


5 


2351 


12 


Q91LL2 


Q91112 white spot 



742 


5 


62 . 


5 


2351 


12 


Q8QTG5 


Q8qtg5 white spot 


743 


5 


62 . 


5 


2364 


2 


Q46342 


Q4 6342 Clostridium 


744 


5 


62 . 


5 


2367 


10 


Q7X970 


Q7x970 oryza sativ 


745 


5 


62 . 


5 


2413 


2 


Q9FDB0 


Q9fdb0 vibrio chol 


746 


5 


62 . 


5 


2588 


11 


088491 


08 8 4 91 mus musculu 


747 


5 


62 . 


5 


2701 


4 


Q9Y520 


Q9y520 homo sapien 


748 


5 


62 . 


5 


2756 


2 


033904 


033904 shewanella 


749 


5 


62 . 


5 


2849 


9 


Q8W6K4 


Q8w6k4 sinorhizobi 


750 


5 


62 . 


5 


3063 


2 


Q59497 


Q594 97 corynebacte 


751 


5 


62 . 


5 


3522 


5 


Q86NF7 


Q8 6nf 7 caenorhabdi 


752 


5 


62 . 


5 


3563 


5 


Q95NZ2 


Q95nz2 caenorhabdi 


753 


5 


62 . 


5 


3600 


10 


Q9SA64 


Q9sa64 arabidopsis 


754 


5 


62 . 


5 


3685 


4 


Q14205 


Q14205 homo sapien 


755 


5 


62 . 


5 


3848 


2 


P94772 


P94772 erwinia chr 


756 


5 


62 . 


5 


4342 


16 


Q9I157 


Q9il57 pseudomonas 


757 


5 


62 . 


5 


4488 


11 


Q9QZH1 


Q9qzhl mus musculu 


758 


5 


62 . 


5 


4559 


5 


Q9U3Y5 


Q9u3y5 drosophila 


759 


5 


62 . 


5 


4666 


10 


Q7XQ92 


Q7xq92 oryza sativ 


760 


5 


62 . 


5 


4730 


5 


Q8T128 


Q8tl28 dictyosteli 


761 


5 


62 . 


5 


4801 


2 


Q9AIT3 


Q 9 ait 3 xanthomonas 


7 62 


5 


62 . 


5 


4918 


5 


Q867N2 


Q8 67 n2 caenorhabdi 


763 


5 


62 . 


5 


4929 


5 


Q867D9 


Q8 67d9 caenorhabdi 


764 


5 


62 . 


5 


4944 


5 


Q8 6NG0 


Q8 6ng0 caenorhabdi 


765 


5 


62 . 


5 


8601 


2 


Q8GM87 


Q8gm8 7 symbiont ba 


766 


5 


62 . 


5 


9477 


2 


Q9L4X3 


Q 91 4x3 s treptomyce 


767 


5 


62 . 


5 


9510 


2 


Q93NX9 


Q9 3nx9 s treptomyce 


768 


5 


62 . 


5 


20925 


3 


Q8NJX1 


Q8n j xl trichoderma 


769 


4 


50 . 


0 


9 


2 


P72149 


P72 14 9 pseudomonas 


770 


4 


50 . 


0 


17 


2 


P83329 


P83329 streptococc 


771 


4 


50 . 


0 


17 


11 


Q61932 


Q61932 mus musculu 


772 


4 


50 . 


0 


19 


4 


Q9UCC8 


Q9ucc8 homo sapien 


773 


4 


50 . 


0 


20 


2 


Q9R4E9 


Q9r4e9 rhodopseudo 


774 


4 


50. 


0 


26 


5 


Q9BM03 


Q9bm03 dugesia tig 


775 


4 


50 . 


0 


26 


7 


046850 


046850 triakis scy 


776 


4 


50 . 


0 


28 


4 


Q9UCD1 


Q9ucdl homo sapien 


111 


4 


50. 


0 


30 


4 


Q9UCA9 


Q9uca9 homo sapien 


IIS 


4 


50 . 


0 


30 


11 


Q9QV42 


Q9qv42 rattus sp . 


779 


4 


50 . 


0 


32 


2 


Q9R5Q7 


Q9r5q7 aeromonas h 


780 


4 


50 . 


0 


33 


4 


Q9UC33 


Q9uc33 homo sapien 


781 


4 


50 . 


0 


33 


5 


Q9GTB2 


Q9gtb2 eimeria ten 


782 


4 


50. 


0 


33 


5 


Q9GTC2 


Q9gtc2 Plasmodium 


783 


4 


50. 


0 


33 


5 


Q9GTA9 


Q9gta9 sarcocystis 


784 


4 


50 . 


0 


33 


5 


Q9GT95 


Q9gt95 cryptospori 


785 


4 


50 . 


0 


33 


5 


Q9GTA2 


Q9gta2 babesia bov 


786 


4 


50 . 


0 


33 


12 


089876 


089876 budgerigar 


787 


4 


50 . 


0 


33 


12 


089877 


089877 budgerigar 


788 


4 


50 . 


0 


33 


12 


089886 


089886 budgerigar 


789 


4 


50. 


0 


33 


12 


089885 


089885 budgerigar 


790 


4 


50 . 


0 


33 


12 


089871 


089871 budgerigar 


791 


4 


50 . 


0 


33 


12 


089881 


089881 budgerigar 


792 


4 


50 . 


0 


33 


12 


089874 


089874 budgerigar 


793 


4 


50 . 


0 


33 


12 


089884 


089884 budgerigar 


794 


4 


50 . 


0 


33 


12 


089882 


089882 budgerigar 


795 


4 


50. 


0 


33 


12 


089873 


089873 budgerigar 


796 


4 


50 . 


0 


33 


12 


089883 


089883 budgerigar 


797 


4 


50 . 


0 


33 


12 


089879 


089879 budgerigar 


798 


4 


50 . 


0 


33 


12 


089875 


089875 budgerigar 



799 


4 


50. 


0 


33 


12 


089872 


089872 budgerigar 


800 


4 


50. 


0 


33 


12 


089887 


089887 budgerigar 


801 


4 


50. 


0 


33 


12 


089880 


089880 budgerigar 


802 


4 


50. 


0 


33 


12 


089878 


089878 budgerigar 


803 


4 


50. 


0 


33 


16 


Q9PH79 


Q9ph79 xylella fas 


804 


4 


50. 


0 


33 


16 


Q97RJ3 


Q97rj3 streptococc 


805 


4 


50. 


0 


33 


16 


Q7VFH8 


Q7vfh8 helicobacte 


806 


4 


50. 


0 


35 


4 


Q8WZ99 


Q8wz99 homo sapien 


807 


4 


50. 


0 


36 


16 


Q8VJK4 


Q8vjk4 mycobacteri 


808 


4 


50. 


0 


36 


16 


Q83D85 


Q83d85 coxiella bu 


809 


4 


50. 


0 


39 


2 


Q9R891 


Q9r891 chlamydia t 


810 


4 


50. 


0 


40 


2 


068327 


068327 caulobacter 


811 


4 


50. 


0 


40 


16 


Q8ECU5 


Q8ecu5 shewanella 


812 


4 


50. 


0 


41 


2 


Q9RZX2 


Q9rzx2 borrelia bu 


813 


4 


50. 


0 


41 


6 


Q9N194 


Q9nl 94 macaca mula 


814 


4 


50. 


0 


41 


6 


Q9N191 


Q9nl91 hylobates 1 


815 


4 


50. 


0 


41 


6 


Q9N193 


Q9nl93 gorilla gor 


816 


4 


50. 


0 


41 


6 


Q9N192 


Q9nl92 pan troglod 


817 


4 


50. 


0 


41 


16 


Q8PGA4 


Q8pga4 xanthomonas 


818 


4 


50. 


0 


41 


16 


Q8F414 


Q8f414 leptospira 


819 


4 


50. 


0 


42 


2 


Q53299 


Q53299 escherichia 


820 


4 


50. 


0 


42 


12 


Q9YPJ0 


Q9ypj0 avian infec 


821 


4 


50. 


0 


42 


15 


Q9Q582 


Q9q582 human immun 


822 


4 


50. 


0 


44 


2 


Q47324 


Q47324 escherichia 


823 


4 


50. 


0 


44 


16 


Q99QL7 


Q99ql7 streptomyce 


824 


4 


50. 


0 


44 


16 


Q8E9C1 


Q8e9cl shewanella 


825 


4 


50. 


0 


44 


16 


Q8DAR5 


Q8dar5 vibrio vuln 


826 


4 


50. 


0 


44 


16 


Q7UQP3 


Q7uqp3 rhodopirell 


827 


4 


50. 


0 


45 


4 


Q9HCW4 


Q9hcw4 homo sapien 


828 


4 


50. 


0 


46 


16 


Q9A039 


Q9a039 streptococc 


829 


4 


50. 


0 


47 


10 


Q7XS03 


Q7xs03 oryza sativ 


830 


4 


50. 


0 


48 


12 


Q85243 


Q85243 potato viru 


831 


4 


50. 


0 


49 


7 


Q9GIQ2 


Q9giq2 eschrichtiu 


832 


4 


50 . 


0 


49 


16 


Q8KF81 


Q8kf81 chlorobium 


833 


4 


50. 


0 


50 


5 


Q8T642 


Q8t642 ceratitis c 


834 


4 


50. 


0 


50 


5 


Q8T643 


Q8t64 3 ceratitis c 


835 


4 


50. 


0 


50 


12 


Q91FI6 


Q91fi6 chilo iride 


836 


4 


50. 


0 


50 


12 


Q85239 


Q85239 potato viru 


837 


4 


50. 


0 


50 


13 


Q08135 


Q08135 pseudopleur 


838 


4 


50. 


0 


50 


16 


Q8X453 


Q8x453 escherichia 


839 


4 


50. 


0 


50 


16 


Q8FF68 


Q8f f 68 escherichia 


840 


4 


50. 


0 


51 


2 


Q51106 


Q51106 neisseria m 


841 


4 


50. 


0 


51 


2 


Q9F8M3 


Q9f8m3 carboxydoth 


842 


4 


50 . 


0 


51 


5 


Q26905 


Q2 69 05 trichostron 


843 


4 


50. 


0 


51 


16 


Q8RAW6 


Q8raw6 thermoanaer 


844 


4 


50. 


0 


51 


16 


Q8G1Q7 


Q8glq7 brucella su 


845 


4 


50. 


0 


51 


16 


Q8FVK0 


Q8fvk0 brucella su 


846 


4 


50 . 


0 


52 


7 


Q9XR67 


Q9xr67 oncorhynchu 


847 


4 


50. 


0 


52 


7 


Q9XR64 


Q9xr64 oncorhynchu 


848 


4 


50. 


0 


52 


7 


Q9XR66 


Q9xr66 oncorhynchu 


849 


4 


50 . 


0 


52 


7 


Q9XR65 


Q9xr65 oncorhynchu 


850 


4 


50 . 


0 


53 


16 


Q89T69 


Q89t69 bradyrhizob 


851 


4 


50. 


0 


53 


16 


Q81TE4 


Q81te4 bacillus an 


852 


4 


50. 


0 


53 


16 


Q81G86 


Q81g86 bacillus ce 


853 


4 


50 . 


0 


53 


16 


Q7UMV1 


Q7umvl rhodopirell 


854 


4 


50 . 


0 


54 


16 


Q99ZY5 


Q99zy5 streptococc 


855 


4 


50. 


0 


54 


16 


Q8DTS1 


Q8dtsl streptococc 



856 


4 


50. 


0 


55 


2 


Q9XC71 


Q9xc71 salmonella 


857 


4 


50 . 


0 


55 


10 


Q38994 


Q3 8 994 arabidopsis 


858 


4 


50 . 


0 


57 


2 


Q8RIW6 


Q8riw6 streptococc 


859 


4 


50 . 


0 


57 


7 


Q29998 


Q29998 homo sapien 


860 


4 


50 . 


0 


57 


7 


Q29997 


Q29997 homo sapien 


861 


4 


50. 


0 


57 


10 


Q38993 


Q3 8 993 arabidopsis 


862 


4 


50 . 


0 


57 


12 


Q91TH0 


Q91th0 tupaia herp 


863 


4 


50 . 


0 


58 


17 


Q977A3 


Q977a3 sulfolobus 


864 


4 


50 . 


0 


59 


9 


Q855I4 


Q85514 mycobacteri 


865 


4 


50 . 


0 


59 


10 


Q8GRR3 


Q8grr3 oryza sativ 


866 


4 


50 . 


0 


60 


16 


Q8FUJ6 


Q8fuj6 corynebacte 


867 


4 


50 . 


0 


61 


2 


Q56859 


Q56859 yersinia en 


868 


4 


50. 


0 


61 


3 


Q9UTY5 


Q9uty5 schizosacch 


869 


4 


50. 


0 


61 


9 


Q857Y1 


Q857yl mycobacteri 


870 


4 


50. 


0 


61 


16 


Q9KPP8 


Q9kpp8 vibrio chol 


871 


4 


50. 


0 


62 


5 


Q8MUB1 


Q8mubl mesobuthus 


872 


4 


50. 


0 


62 


5 


Q7YVW6 


Q7yvw6 trypanosoma 


873 


4 


50 . 


0 


62 


5 


Q7YSI4 


Q7ysi4 trypanosoma 


874 


4 


50 . 


0 


62 


10 


Q9XF78 


Q9xf78 oryza sativ 


875 


4 


50 . 


0 


62 


10 


Q39072 


Q3 9072 arabidopsis 


876 


4 


50 . 


0 


62 


16 


Q8NPR0 


Q8npr0 corynebacte 


877 


4 


50 . 


0 


62 


16 


Q7WKU6 


Q7wku6 bordetella 


878 


4 


50. 


0 


62 


16 


Q7W7F8 


Q7w7f8 bordetella 


879 


4 


50. 


0 


62 


16 


Q7WB1 


Q7vvbl bordetella 


880 


4 


50. 


0 


63 


2 


Q9RP37 


Q9rp37 mycobacteri 


881 


4 


50 . 


0 


63 


2 


Q93PX9 


Q93px9 pseudomonas 


882 


4 


50 . 


0 


63 


17 


Q8ZY13 


Q8zyl3 pyrobaculum 


883 


4 


50 . 


0 


64 


2 


Q8KK86 


Q8kk86 proteus vul 


884 


4 


50 . 


0 


64 


9 


Q8HAG9 


Q8hag9 salmonella 


885 


4 


50. 


0 


65 


8 


Q8 5MU1 


Q85mul homo sapien 


886 


4 


50 . 


0 


65 


16 


Q8YGX3 


Q8ygx3 brucella me 


887 


4 


50. 


0 


65 


16 


Q81Q04 


Q81q04 bacillus an 


888 


4 


50 . 


0 


66 


2 


P97017 


P97017 aeromonas j 


889 


4 


50 . 


0 


66 


10 


Q7XNZ4 


Q7xnz4 oryza sativ 


890 


4 


50 . 


0 


66 


12 


Q8QIA7 


Q8qia7 hepatitis c 


891 


4 


50. 


0 


66 


12 


Q84302 


Q8 4 3 02 human papil 


892 


4 


50 . 


0 


66 


12 


Q80DG6 


Q80dg6 hepatitis c 


893 


4 


50 . 


0 


66 


12 


Q80D76 


Q80d76 hepatitis c 


894 


4 


50 . 


0 


66 


16 


Q92ZR2 


Q92zr2 rhizobium m 


895 


4 


50 . 


0 


66 


16 


Q8YJG6 


Q8yjg6 brucella me 


896 


4 


50. 


0 


66 


16 


050486 


050486 streptomyce 


897 


4 


50 . 


0 


66 


16 


Q8G0Y9 


Q8g0y9 brucella su 


898 


4 


50. 


0 


67 


2 


Q9L8X0 


Q918x0 streptomyce 


899 


4 


50. 


0 


67 


2 


Q9L8X2 


Q918x2 streptomyce 


900 


4 


50. 


0 


67 


10 


Q8L9S8 


Q819s8 arabidopsis 


901 


4 


50 . 


0 


67 


16 


Q8XBH8 


Q8xbh8 escherichia 


902 


4 


50 . 


0 


67 


16 


Q7UGP7 


Q7ugp7 rhodopirell 


903 


4 


50. 


0 


68 


2 


Q52624 


Q52624 proteus vul 


904 


4 


50. 


0 


68 


5 


Q9TXY3 


Q9txy3 caenorhabdi 


905 


4 


50 . 


0 


68 


5 


Q9BL96 


Q9bl96 caenorhabdi 


906 


4 


50 . 


0 


68 


16 


Q9KN04 


Q9kn04 vibrio chol 


907 


4 


50. 


0 


68 


16 


Q8R6R7 


Q8 r6r7 thermoanaer 


908 


4 


50 . 


0 


69 


12 


Q8VAI7 


Q8vai7 white spot 


909 


4 


50 . 


0 


69 


16 


Q8XPH7 


Q8xph7 ralstonia s 


910 


4 


50 . 


0 


69 


16 


Q8DFY4 


Q8df y4 vibrio vuln 


911 


4 


50. 


0 


69 


16 


Q88KL5 


Q8 8kl5 pseudomonas 


912 


4 


50. 


0 


70 


12 


Q85247 


Q85247 potato viru 



913 


4 


50 . 


0 


70 


12 


Q98622 


Q98622 potato viru 


914 


4 


50 . 


0 


70 


16 


Q8PJ81 


Q8pj81 xanthomonas 


915 


4 


50 . 


0 


70 


16 


Q83FD3 


Q83fd3 coxiella bu 


916 


4 


50 . 


0 


70 


17 


Q8PT31 


Q8pt31 methanosarc 


917 


4 


50 . 


0 


71 


2 


Q92N67 


Q9zn67 ruminobacte 


918 


4 


50. 


0 


71 


2 


Q93PT2 


Q93pt2 lactococcus 


919 


4 


50. 


0 


71 


9 


Q854I4 


Q85414 mycobacteri 


920 


4 


50. 


0 


71 


16 


Q8A4Q1 


Q8a4ql bacteroides 


921 


4 


50. 


0 


72 


5 


Q8IDT7 


Q8idt7 plasmodium 


922 


4 


50. 


0 


72 


9 


Q8HA64 


Q8ha64 bacteriopha 


923 


4 


50 . 


0 


72 


9 


Q8HA55 


Q8ha55 bacteriopha 


924 


4 


50 . 


0 


72 


9 


Q855R4 


Q855r4 mycobacteri 


925 


4 


50. 


0 


72 


9 


Q855G5 


Q855g5 mycobacteri 


926 


4 


50. 


0 


72 


12 


Q9YMU8 


Q9ymu8 lymantria d 


927 


4 


50. 


0 


72 


16 


Q99TW2 


Q99tw2 staphylococ 


928 


4 


50. 


0 


72 


16 


Q8NWD2 


Q8nwd2 staphylococ 


929 


4 


50. 


0 


72 


16 


Q8K7S9 


Q8k7s9 streptococc 


930 


4 


50. 


0 


73 


4 


095641 


095641 homo sapien 


931 


4 


50 . 


0 


73 


9 


Q855W3 


Q855w3 mycobacteri 


932 


4 


50 . 


0 


73 


10 


Q8GX82 


Q8gx82 arabidopsis 


933 


4 


50 . 


0 


73 


16 


Q8U8E9 


Q8u8e9 agrobacteri 


934 


4 


50. 


0 


73 


16 


Q8X2G0 


Q8x2g0 escherichia 


935 


4 


50. 


0 


74 


2 


Q9S460 


Q9s460 streptococc 


936 


4 


50. 


0 


74 


2 


Q83WF4 


Q83wf4 micromonosp 


937 


4 


50. 


0 


74 


16 


Q92A64 


Q92a64 listeria in 


938 


4 


50 . 


0 


74 


16 


Q8Y5W1 


Q8y5wl listeria mo 


939 


4 


50. 


0 


74 


16 


Q8XGR2 


Q8xgr2 salmonella 


940 


4 


50. 


0 


74 


16 


Q82RD7 


Q82rd7 streptomyce 


941 


4 


50. 


0 


74 


16 


Q83RL2 


Q83rl2 shigella fl 


942 


4 


50 . 


0 


75 


2 


Q9F4H3 


Q9f4h3 yersinia en 


943 


4 


50 . 


0 


75 


2 


Q48678 


Q48678 lactococcus 


944 


4 


50. 


0 


75 


6 


P79246 


P79246 sus scrofa 


945 


4 


50. 


0 


75 


6 


Q9GMH0 


Q9gmh0 macaca mula 


946 


4 


50. 


0 


75 


9 


Q38648 


Q38648 bacteriopha 


947 


4 


50 . 


0 


75 


16 


Q9PND7 


Q9pnd7 campylobact 


948 


4 


50 . 


0 


75 


16 


Q98PE6 


Q98pe6 mycoplasma 


949 


4 


50. 


0 


75 


16 


Q8XVB9 


Q8xvb9 ralstonia s 


950 


4 


50. 


0 


75 


16 


Q8EAN1 


Q8eanl shewanella 


951 


4 


50. 


0 


75 


16 


Q87UB3 


Q87ub3 pseudomonas 


952 


4 


50. 


0 


75 


17 


Q8Q085 


Q8q085 methanosarc 


953 


4 


50. 


0 


76 


2 


Q9AH64 


Q9ah64 neisseria g 


954 


4 


50. 


0 


76 


5 


002324 


002 324 caenorhabdi 


955 


4 


50. 


0 


76 


9 


Q38628 


Q38628 bacteriopha 


956 


4 


50. 


0 


76 


9 


Q38626 


Q38626 bacteriopha 


957 


4 


50. 


0 


76 


11 


Q8K3X8 


Q8k3x8 rattus norv 


958 


4 


50 . 


0 


76 


11 


Q91Y43 


Q91y43 rattus norv 


959 


4 


50 . 


0 


76 


12 


Q9Q8W0 


Q9q8w0 shope fibro 


960 


4 


50. 


0 


77 


2 


Q9Z3X0 


Q9z3x0 sphingomona 


961 


4 


50. 


0 


77 


11 


Q99M09 


Q9 9m0 9 mus musculu 


962 


4 


50 . 


0 


77 


16 


Q8D8U0 


Q8d8u0 vibrio vuln 


963 


4 


50 . 


0 


77 


16 


Q87QH5 


Q87qh5 vibrio para 


964 


4 


50. 


0 


77 


17 


026981 


026981 methanobact 


965 


4 


50. 


0 


78 


5 


Q9NFF6 


Q9nff6 plasmodium 


966 


4 


50 . 


0 


78 


16 


Q8VJD0 


Q8vjd0 mycobacteri 


967 


4 


50. 


0 


78 


17 


Q8TIK3 


Q8tik3 methanosarc 


968 


4 


50. 


0 


79 


11 


035463 


035463 cricetulus 


969 


4 


50. 


0 


79 


12 


Q84524 


Q84524 paramecium 



970 


4 


50 . 


0 


80 


2 


085975 


08597 5 sphingomona 


971 


4 


50 . 


0 


80 


5 


Q8T640 


Q8t640 apis mellif 


972 


4 


50 . 


0 


80 


5 


Q8I5N0 


Q8i5n0 Plasmodium 


973 


4 


50 . 


0 


80 


10 


Q8L7F0 


Q817f0 arabidopsis 


974 


4 


50. 


0 


80 


16 


Q99SP0 


Q99sp0 staphylococ 


975 


4 


50 . 


0 


80 


16 


Q92KC9 


Q92kc9 rhizobium m 


976 


4 


50 . 


0 


80 


16 


Q82VB4 


Q82vb4 nitrosomona 


977 


4 


50. 


0 


81 


2 


Q8 3WU3 


Q83wu3 thiocapsa r 


978 


4 


50. 


0 


81 


2 


Q9LA36 


Q91a36 shigella dy 


979 


4 


50. 


0 


81 


2 


Q9EU23 


Q9eu23 escherichia 


980 


4 


50. 


0 


81 


4 


Q9P1F3 


Q9plf3 homo sapien 


981 


4 


50. 


0 


81 


9 


Q8HAI0 


Q8hai0 salmonella 


982 


4 


50. 


0 


81 


9 


Q852W3 


Q8 52w3 mycobacteri 


983 


4 


50. 


0 


81 


16 


Q7U5U9 


Q7u5u9 synechococc 


984 


4 


50 . 


0 


82 


4 


Q16020 


Q16020 homo sapien 


985 


4 


50 . 


0 


82 


4 


Q16014 


Q16014 homo sapien 


986 


4 


50 . 


0 


82 


4 


Q16019 


Q16019 homo sapien 


987 


4 


50. 


0 


82 


7 


Q9XRU0 


Q9xru0 rattus f us c 


988 


4 


50. 


0 


82 


7 


Q9XRT7 


Q9xrt7 rattus fuse 


989 


4 


50 . 


0 


82 


7 


Q30582 


Q30582 macaca fasc 


990 


4 


50 . 


0 


82 


7 


Q30695 


Q3 0 695 macaca mula 


991 


4 


50 . 


0 


82 


7 


Q30884 


Q30884 papio hamad 


992 


4 


50 . 


0 


82 


7 


Q30581 


Q30581 macaca fasc 


993 


4 


50. 


0 


82 


9 


Q7Y3B9 


Q7y3b9 stxl conver 


994 


4 


50. 


0 


82 


9 


Q7Y2V8 


Q7y2v8 stx2 conver 


995 


4 


50 . 


0 


82 


15 


090069 


09 0 0 69 human immun 


996 


4 


50 . 


0 


82 


16 


Q8UDZ3 


Q8udz3 agrobacteri 


997 


4 


50 . 


0 


82 


16 


Q8ZEE2 


Q8zee2 yersinia pe 


998 


4 


50 . 


0 


82 


16 


Q87P24 


Q87p24 vibrio para 


999 


4 


50 . 


0 


82 


16 


Q7VQG0 


Q7vqg0 candidatus 


1000 


4 


50. 


0 


82 


16 


Q7V5R5 


Q7v5r5 prochloroco 



ALIGNMENTS 



RESULT 1 
Q8P193 

ID Q8P193 PRELIMINARY; PRT; 54 AA. 

AC Q8P193; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Hypothetical protein spyM18_1000. 

GN SPYM18_1000. 

OS Streptococcus pyogenes (serotype M18). 

OC Bacteria; Firmicutes; Lactobacillales ; Streptococcaceae; 

OC Streptococcus . 

OX NCBI__TaxID=18 6103; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=MGAS82 32 / Serotype M18; 

RX MEDLINE-2 1927 593; PubMed=11917 10 8 ; 

RA Smoot J.C., Barbian K.D., Van Gompel J. J., Smoot L.M., Chaussee M. S 

RA Sylva G.L., Sturdevant D.E., Ricklefs S.M., Porcella S.F., 

RA Parkins L.D., Beres S.B., Campbell D.S., Smith T.M., Zhang Q. , 

RA Kapur V., Daly J. A., Veasy L.G., Musser J.M.; 



RT "Genome sequence and comparative microarray analysis of serotype M18 

RT group A Streptococcus strains associated with acute rheumatic fever 

RT outbreaks . " ; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:4668-4 673(2 002). 

DR EMBL; AE010028; AAL97638.1; -. 

KW Hypothetical protein; Complete proteome . 

SQ SEQUENCE 54 AA; 6370 MW; 5ED4 FECC614 9C9 8E CRC64; 

Query Match 75.0%; Score 6; DB 16; Length 54; 

Best Local Similarity 100.0%; Pred. No. 15; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDA 6 

I I I I I I 

Db 18 EVNLDA 2 3 



RESULT 2 




Q89LY8 




ID 


Q89LY8 PRELIMINARY; PRT ; 91 AA. 




AC 


Q89LY8; 




DT 


01-JUN-2003 (TrEMBLrel. 24, Created) 




DT 


01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 




DT 


01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 




DE 


Bsl4405 protein. 




GN 


BSL4405. 




OS 


Bradyrhi zobium j aponicum. 




OC 


Bacteria; Proteobacteria; Alphaproteobacteria ; Rhizobiales; 


OC 


Bradyrhi z obi ace a e ; Bradyrhi zobium. 




ox 


NCBI TaxID=375; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=USDA 110; 




RX 


MEDLINE=22 4 84998; PubMed=12597275 ; 




RA 


Kaneko T., Nakamura Y., Sato S., Minamisawa K. , Uchiumi 


T 


RA 


Sasamoto S., Watanabe A., Idesawa K. , Iriguchi M. , Kawashima K., 


RA 


Kohara M. , Matsumoto M. , Shimpo S., Tsuruoka H., Wada T 


. , Yamada M. , 


RA 


Tabata S . ; 




RT 


"Complete genomic sequence of nitrogen- fixing symbiotic 


bacterium 


RT 


Bradyrhi zobium j aponicum USDA110." ; 




RL 


DNA Res. 9:189-197(2002) . 




DR 


EMBL; AP005951; BAC49670.1; 




KW 


Complete proteome . 




SQ 


SEQUENCE 91 AA; 10366 MW; BD512A98AC3F55AC CRC64; 





Query Match 75.0%; Score 6; DB 16; Length 91; 

Best Local Similarity 100.0%; Pred. No. 23; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 NLDAEF 8 

I I 1 I I I 

Db 2 8 NLDAEF 33 



RESULT 3 
Q80PA8 

ID Q80PA8 PRELIMINARY; PRT; 101 AA. 



AC Q8 0PA8; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Orf6 (Fragment). 

OS Spodoptera litura nucleopolyhedrovirus . 

OC Viruses; dsDNA viruses, no RNA stage; Baculoviridae ; 

OC Nucleopolyhedrovirus . 

OX NCBI_TaxID=46242 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Jain M. , Das R.H.; 

RT "Molecular characterization of the structural glycoprotein gp41 gene 

RT of Spodoptera litura nucleopolyhedrosis virus."; 

RL Submitted (DEC-2002) to the EMBL/GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Jain M. , Das R.H.; 

RT "Nucleotide sequence of a 4053bp fragment of the SpltNPV genome and 

RT comparison with the AcNPV genome."; 

RL Submitted (DEC-2002) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AF445192; AA043572.1; 

FT NON_TER 101 101 

SQ SEQUENCE 101 AA; 12174 MW; OD9DE8 36AA4 CAFES CRC64 ; 

Query Match 75.0%; Score 6; DB 12; Length 101; 

Best Local Similarity 100.0%; Pred. No. 26; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 3 NLDAEF 8 

I I 1 E E I 

Db 9 6 NLDAEF 101 



RESULT 4 




Q87ZB6 




ID 


Q87ZB6 PRELIMINARY; PRT; 173 AA. 




AC 


Q87ZB6; 




DT 


01-JUN-2003 (TrEMBLrel. 24, Created) 




DT 


01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 




DT 


01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 




DE 


Conserved hypothetical protein. 




GN 


PSPT03513 . 




OS 


Pseudomonas syringae (pv. tomato) . 




OC 


Bacteria; Proteobacteria ; Gammaproteobacteria ; Pseudomonadales ; 




OC 


Pseudomonadaceae ; Pseudomonas. 




ox 


NCBI TaxID=323; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=DC3000; 




RA 


Buell R. , Joardar V., Khouri H., Fedorova N . , Tran B., Russell D. 


r 


RA 


Berry K. , Utterback T., Van Aken S., Feldblyum T., Gwinn M. , 




RA 


Dodson R. , DeBoy R. , Durkin A., Kolonay J., Madupu R. , Daugherty 


S. , 


RA 


Brinkac L., Beanan M. , Haft D., Selengut J., Nelson W., Davidsen 


T. , 


RA 


White O., Fraser C, Collmer A.; 




RT 


"Complete sequence of Pseudomonas syringae. 




RL 


Submitted (MAR-2003) to the EMBL/GenBank/DDB J databases. 





DR EMBL; AE016868; AA056988.1; -. 

DR TIGR; PSPT03513; -. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 173 AA; 19395 MW; 6FB8ACC1 13B365C2 CRC64; 

Query Match 75. 0%; Score 6; DB 16; Length 173; 

Best Local Similarity 100.0%; Pred. No. 41; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDAE 7 

! I I I I I 

Db 137 VNLDAE 142 



RESULT 5 
Q82WI7 

ID Q82WI7 PRELIMINARY; PRT; 221 AA. 

AC Q82WI7; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE LeuD; 3-isopropylmalate dehydratase small subunit (EC 4.2.1.33) . 

GN LEUD OR NE0687. 

OS Nitrosomonas europaea. 

OC Bacteria; Proteobacteria ; Betaproteobacteria ; Nitrosomonadales ; 

OC Nitrosomonadaceae ; Nitrosomonas. 

OX NCBI_TaxID=915; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ATCC 19718 / IFO 14298; 

RX MEDLINE=2258 6410; PubMed=127 00255 ; 

RA Chain P., Lamerdin J.E., Larimer F.W., Regala W. , Lao V., Land M. , 

RA Hauser L., Hooper A.B., Klotz M.G., Norton J., Sayavedra-Soto L.A. , 

RA Arciero D.M., Hommes N.G., Whittaker M.M. , Arp D.J.; 

RT "Complete genome sequence of the ammonia-oxidizing bacterium and 

RT obligate chemolithoautotroph Nitrosomonas europaea."; 

RL J. Bacterid. 185:2759-2773(2003). 

DR EMBL; BX321858; CAD84598.1; 

DR GO; GO: 0009316; C : 3-isopropylmalate dehydratase complex; IEA. 

DR GO; GO: 0003861; F : 3-isopropylmalate dehydratase activity; IEA. 

DR GO; GO:0016829; F:lyase activity; IEA. 

DR GO; GO:0009098; P:leucine biosynthesis; IEA. 

DR GO; GO: 0008152; P : metabolism; IEA. 

DR InterPro; IPR000573; Aconitase_C. 

DR InterPro; IPR004431; LeuD. 

DR Pfam; PF00694; Aconitase_C; 1. 

DR TIGRFAMs; TIGR00171; leuD; 1. 

KW Lyase; Complete proteome. 

SQ SEQUENCE 221 AA; 25456 MW; 63E2A2 1073AE8C8 8 CRC64; 

Query Match 75.0%; Score 6; DB 16; Length 221; 

Best Local Similarity 100.0%; Pred. No. 51; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDA 6 

MINI 

Db 151 EVNLDA 156 



RESULT 6 
Q8A4M0 

ID Q8A4M0 PRELIMINARY; PRT; 2 81 AA. 

AC Q8A4M0; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Conserved hypothetical protein. 

GN BT2579. 

OS Bacteroides thetaiotaomicron . 

OC Bacteria; Bacteroidetes ; Bacteroides (class); Bacteroidales ; 

OC Bacteroidaceae; Bacteroides . 

OX NCBI_TaxID=818 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=VPI-5482 / ATCC 29148; 

RX MEDLINE=22 550 858 ; PubMed=12 66392 8 ; 

RA Xu J., Bjursell M.K., Himrod J., Deng S., Carmichael L.K., 

RA Chiang H.C., Hooper L.V., Gordon J.I.; 

RT "A genomic view of the human-Bacteroides thetaiotaomicron symbiosis."; 

RL Science 2 99:2074-2076(2003). 

DR EMBL; AE016936; AA077686.1; -. 

DR GO; GO: 0005634; Crnucleus; IEA. 

DR GO; GO:0003677; F : DNA binding ; IEA. 

DR GO; GO: 0016233; P: telomere capping; IEA. 

DR InterPro; IPR003415; Telo_bind_alpha . 

DR ProDom; PD018295; Telo_bind_alpha ; 1. 

fCW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 281 AA; 33830 MW; 4 00ADEE36AC9563F CRC64 ; 

Query Match 75.0%; Score 6; DB 16; Length 281; 

Best Local Similarity 100.0%; Pred. No. 63; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDAE 7 

I I i I I I 

Db 2 64 VNLDAE 2 69 



RESULT 7 
Q93ET9 

ID Q93ET9 PRELIMINARY; PRT; 299 AA. 

AC Q93ET9; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Aminoglycoside phophotransf erase . 

GN APH(2')-IB. 

OS Escherichia coli. 

OC Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales ; 

OC Enterobacteriaceae; Escherichia. 

OX NCBI_TaxID=562 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=214 41138; PubMed=11557 45 6; 



RA Chow J.W., Kak V., You I . , Kao S.J., Petrin J., Clewell D.B., 

RA Lerner S.A., Miller G.H., Shaw K.J.; 

RT "Aminoglycoside Resistance Genes aph(2')-Ib and aac(6 f )-Im Detected 

RT Together in Strains of both Escherichia coli and Enterococcus 

RT faecium."; 

RL Antimicrob. Agents Chemother. 45:2691-2694(2001). 

DR EMBL; AF337947; AAK63040.1; 

DR GO; GO: 0005524; F : ATP binding; IEA. 

DR GO; GO: 0004713; F : protein- tyrosine kinase activity; IEA. 

DR GO; GO: 0016740; F : trans f erase activity; IEA. 

DR GO; GO: 0006468; Prprotein amino acid phosphorylation; IEA. 

DR InterPro; IPR002575; APH . 

DR InterPro; IPR008266; Tyr_pkinase_AS . 

DR Pfam; PF01636; APH; 1. 

DR PROSITE; PS00109; PROT E I N_KINAS E_T YR ; 1. 

KW Trans f erase . 

SQ SEQUENCE 299 AA; 35350 MW; 6A69D4F6E6901C81 CRC64; 

Query Match 75.0%; Score 6; DB 2; Length 299; 

Best Local Similarity 100.0%; Pred. No. 66; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 2 VNLDAE 7 

I I I I I 

Db 2 VNLDAE 7 



RESULT 8 






Q9EVD7 






ID 


Q9EVD7 PRELIMINARY; PRT; 2 99 AA. 






AC 


Q9EVD7 ; 






DT 


01-MAR-2001 { TrEMBLrel . 16, Created) 






DT 


01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 






DT 


01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 






DE 


Aminoglycoside phosphotransferase . 






GN 


APH (2 1 ) -IB. 






OS 


Enterococcus faecium (Streptococcus faecium) . 






OC 


Bacteria; Firmicutes; Lactobacillales ; Enterococcaceae; 


Enterococcus . 


ox 


NCBI TaxID=1352; 






RN 


[1] 






RP 


SEQUENCE FROM N . A. 






RC 


STRAIN=SF11770; 






RX 


MEDLINE=20448770; PubMed=l 09 9187 8 ; 






RA 


Kao S.J., You I., Clewell D.B., Donabedian S.M., Zervos 


M.J. 




RA 


Petrin J., Shaw K.J., Chow J.W.; 






RT 


"Detection of the high-level aminoglycoside resistance 


gene 


aph (2 ' ) -I 


RT 


in Enterococcus faecium."; 






RL 


Antimicrob. Agents Chemother. 44:2876-2879(2000). 






DR 


EMBL; AF207840; AAG13458.1; 






DR 


GO; GO: 0005524; F : ATP binding; IEA. 






DR 


GO; GO: 0004713; F : protein- tyrosine kinase activity; IEA 


* 




DR 


GO; GO: 0016740; F : trans f erase activity; IEA. 






DR 


GO; GO: 0006468; P:protein amino acid phosphorylation; IEA. 




DR 


InterPro; IPR002575; APH. 






DR 


InterPro; IPR008266; Tyr_pkinase AS. 






DR 


Pfam; PF01636; APH; 1. 






DR 


PROSITE; PS00109; PROTEIN KINASE TYR; 1. 







KW Transferase. 

SQ SEQUENCE 299 AA; 35353 MW; 8B99DB5020FDAE23 CRC64; 



Query Match 75.0%; Score 6; DB 2; Length 2 99; 

Best Local Similarity 100.0%; Pred. No. 66; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDAE 7 

MINI 

Db 2 VNLDAE 7 



RESULT 9 
Q933C0 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RX 
RA 
RA 
RA 
RT 
RT 
RL 
RN 
RP 
RC 
RA 
RL 
RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 
CC 
DR 
DR 



304 AA, 



19, Created) 

19, Last sequence update) 
25, Last annotation update) 



C., Manyani H., Lopez-Lara I., 
Quinto C., Gil-Serrano A., 



Q933C0 PRELIMINARY; PRT; 

Q933C0; 

01-DEC-2 0 01 (TrEMBLrel , 
01-DEC-2001 (TrEMBLrel 
01-OCT-2 0 03 (TrEMBLrel . 
Nodi protein. 
NODI. 

Rhizobium tropici . 

Bacteria; Proteobacteria ; Alphaproteobacteria ; Rhizobiales; 
Rhizobiaceae ; Rhizobium/ Agrobacterium group; Rhizobium. 
NCBI_TaxID=398; 
[1] 

SEQUENCE FROM N . A. 
STRAIN=CIAT8 99; 

MEDLINE=97 0 0274 6; PubMed=8 8500 86 ; 
Folch-Mallol J.L., Marroqui S., Sousa 
van der Drift K.M.G.M., Haverkamp J., 
Thomas-Oates J., Spaink H.P., Megias M. ; 

"Characterization of Rhizobium tropici CIAT899 nodulation factors: The 
role of nodH and nodPQ genes in their sulfation."; 
Mol. Plant Microbe Interact. 9:151-163(1996). 
[2] 

SEQUENCE FROM N . A. 
STRAIN=CIAT8 99; 

Manyani H., Sousa C, Soria-Diaz M.E., Gil-Serrano A., Megias M. ; 

Submitted (MAY-2001) to the EMBL/ GenBank/DDBJ databases. 

[3] 

SEQUENCE FROM N.A. 
STRAIN=CIAT8 99; 

MEDLINE=2 13 60362 ; PubMed=114 67733 ; 
Manyani H., Sousa C, Soria Diaz M.E 
"Regulation of nod factor sulphation 
CIAT899. "; 

Can. J. Microbiol. 47:574-579(2001). 



, Gil-Serrano A. , Megias M. ; 
genes in Rhizobium tropici 



-!- SIMILARITY 
EMBL; X87608; 
EMBL; AY029295 



BELONGS TO THE ABC 
CAC39128.1; 

AAL02416. 1; -. 



TRANSPORTER FAMILY. 



DR 


GO; 


GO: 


0016020; 


C: 


membrane; IEA. 


DR 


GO; 


GO: 


0005524; 


F: 


ATP binding; IEA. 


DR 


GO; 


GO: 


0004009; 


F: 


ATP-binding cassette (ABC) 


DR 


GO; 


GO : 


0000166; 


F: 


nucleotide binding; IEA. 


DR 


GO; 


GO: 


0005215; 


F: 


transporter activity; IEA. 


DR 


GO; 


GO: 


0009877; 


P: 


nodulation; IEA. 



transporter acti . 



IEA. 



DR GO; GO: 0006810; P: transport; IEA. 

DR InterPro; IPR003593; AAA_AT P a s e . 

DR InterPro; IPR003439; ABC_transporter . 

DR InterPro; IPR005978; ABC_transptNodI . 

DR Pfam; PF00005; ABC_tran; 1. 

DR ProDom; PD000006; ABC_transporter ; 1. 

DR SMART; SM0 0 382; AAA; 1. 

DR TIGRFAMs; TIGR01288; nodi; 1. 

DR PROSITE; PS00211; ABC_T RAN S PORT ER_1 ; 1. 

DR PROSITE; PS508 93; ABC_T RAN S PORT ER_2 ; 1. 

KW ATP-binding; Transport. 

SQ SEQUENCE 304 AA; 33635 MW; A9A3A4 62DCEE5E92 CRC64; 

Query Match 75.0%; Score 6; DB 2; Length 304; 

Best Local Similarity 100.0%; Pred. No. 67; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 NLDAEF 8 

II I I I I 

Db 8 8 NLDAEF 93 



RESULT 10 
Q8H0B0 

ID Q8H0B0 PRELIMINARY; PRT; 308 AA. 

AC Q8H0B0; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Putative gibberellin 20-oxidase. 

GN OSJNBA0096G08 . 6. 

OS Oryza sativa (japonica cultivar-group ) . 

OC Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta ; 

OC Spermatophyta; Magnoliophyta ; Liliopsida; Poales; Poaceae; 

OC Ehrhartoideae ; Oryzeae; Oryza. 

OX NCBI_TaxID=39947 ; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=cv. Nipponbare; 

RA Buell C.R., Yuan Q. r Ouyang S . , Liu J., Gansberger K., Kim M.M., 

RA Overton II L.L., Bera J. J., Tsitrin T., Krol M.I., Jarrahi B.B., 

RA Jin S . S . , Koo H., Zismann V. , Hsiao J., Blunt S., Vanaken S.S., 

RA Utterback T.T., Feldblyum T.V., Yang Q.Q., Haas B.J., Suh B.B., 

RA Peterson J. J., Quackenbush J., White O., Salzberg S.L., Fraser CM.; 

RT "Oryza sativa chromosome 10 BAC OS JNBaOO 96G08 genomic sequence."; 

RL Submitted (DEC-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AC078894; AAO00709.1; 

DR GO; GO: 0005506; Friron ion binding; IEA. 

DR GO; GO: 0016216; F : isopenicillin-N synthase activity; IEA. 

DR GO; GO: 0017000; P:antibiotic biosynthesis; IEA. 

DR InterPro; IPR005123; 20G-FeII_Oxy . 

DR InterPro; IPR002283; IPN_synth. 

DR Pfam; PF03171; 20G-FeII_Oxy ; 1. 

DR PRINTS; PR00682; IPNSYNTHASE. 

SQ SEQUENCE 308 AA; 34541 MW; AC6F1 6A95D7 8D905 C RC 64; 



Query Match 



75.0%; Score 6; DB 10; Length 308; 



Best Local Similarity 100.0%; Pred. No. 68; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gap 



Qy 3 NLDAEF 8 

MINI 

Db 155 NLDAEF 160 



RESULT 11 
Q7V5K6 

ID Q7V5K6 PRELIMINARY; PRT; 319 AA. 

AC Q7V5K6; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel . 25, Last annotation update) 

DE DnaJ3 protein. 

GN DNAJ3 OR PMT1548. 

OS Prochlorococcus marinus (strain MIT 9313). 

OC Bacteria; Cyanobacteria ; Prochlorophytes ; Prochlorococcaceae; 

OC Prochlorococcus . 

OX NCBI_TaxID=74547 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22 82569 8 ; PubMed=12 917 642 ; 

RA Rocap G. f Larimer F . W . , Lamerdin J., Malfatti S., Chain P., 

RA Ahlgren N.A. , Arellano A., Coleman M. , Hauser L., Hess W.R., 

RA Johnson Z.I., Land M. , Lindell D . , Post A.F., Regala W., Shah M. , 

RA Shaw S.L., Steglich C, Sullivan M.B., Ting C.S., Tolonen A., 

RA Webb E.A., Zinser E.R., Chisholm S.W.; 

RT "Genome divergence in two Prochlorococcus ecotypes reflects oceanic 

RT niche differentiation."; 

RL Nature 424:1042-1047(2003). 

DR EMBL; BX572099; CAE21723.1; 

KW Complete proteome. 

SQ SEQUENCE 319 AA; 34597 MW; 8 02 6B7257 7 603912 CRC64; 

Query Match 75.0%; Score 6; DB 16; Length 319; 

Best Local Similarity 100.0%; Pred. No. 70; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gap 

Qy 2 VNLDAE 7 

Mill! 

Db 141 VNLDAE 146 



RESULT 12 
Q88P95 

ID Q88P95 PRELIMINARY; PRT; 324 AA. 

AC Q88P95; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE KpsF/GutQ family protein. 

GN PP0957. 

OS Pseudomonas putida (strain KT2440) . 

OC Bacteria; Proteobacteria ; Gammaproteobacteria ; Pseudomonadales ; 
OC Pseudomonadaceae ; Pseudomonas. 



OX NCBI_TaxID=160488 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 2 423 0 60; PubMed=l 2 534 4 63; 

RA Nelson K.E., Weinel C, Paulsen I.T., Dodson R.J., Hilbert 

RA Martins dos Santos V.A.P., Fouts D.E., Gill S.R., Pop M. , Holmes M. , 

RA Brinkac L . , Beanan M. , DeBoy R.T., Daugherty S., Kolonay J . , 

RA Madupu R. , Nelson W. , White O . , Peterson J., Khouri H . , Hance I., 

RA Chris Lee P . , Holtzapple E., Scanlan D., Tran K. r Moazzez A., 

RA Utterback T . , Rizzo M. , Lee K. , Kosack D . r Moestl D., Wedler H., 

RA Lauber J., Stjepandic D . , Hoheisel J., Straetz M. , Heim S . , 

RA Kiewitz C, Eisen J., Timmis K.N., Duesterhoeft A., Tuemmler B., 

RA Fraser CM. ; 

RT "Complete genome sequence and comparative analysis of the 

RT metabolically versatile Pseudomonas putida KT2440." ; 

RL Environ. Microbiol. 4:799-808(2002). 

DR EMBL; AE016777; AAN66582.1; -. 

DR TIGR; PP0957; -. 

DR GO; GO: 0005529; F: sugar binding; IEA. 

DR GO; GO: 0005975; P : carbohydrate metabolism; IEA. 

DR InterPro; IPR000644; CBS_domain. 

DR InterPro; IPR001347; SIS. 

DR Pfam; PF00571; CBS; 2. 

DR Pfam; PF01380; SIS; 1. 

KW Complete proteome . 

SQ SEQUENCE 324 AA; 34397 MW; 67CEF2C1606E962F CRC64; 



Query Match 75.0%; Score 6; DB 16; Length 324; 

Best Local Similarity 100.0%; Pred. No. 71; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDA 6 

I I i I I I 

Db 137 EVNLDA 142 



RESULT 13 
Q7U4X9 

ID Q7U4X9 PRELIMINARY; PRT; 324 AA. 

AC Q7U4X9; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE DnaJ3 protein. 

GN DNAJ3 OR SYNW1934 . 

OS Synechococcus sp . (strain WH8102) . 

OC Bacteria; Cyanobacteria; Chroococcales ; Synechococcus. 

OX NCBI__TaxID=84588 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 2 825697; PubMed=12 917641; 

RA Palenik B., Brahamsha B., Larimer F.W., Land M., Hauser L . , Chain P. 

RA Lamerdin J., Regala W., Allen E.E., McCarren J., Paulsen I., 

RA Dufresne A., Partensky F., Webb E.A., Waterbury J.; 

RT "The genome of a motile marine Synechococcus. "; 

RL Nature 424:1037-1042(2 003). 

DR EMBL; BX569694; CAE08449.1; -. 



KW Complete proteome. 

SQ SEQUENCE 324 AA; 34972 MW; 8BD4931073F87B31 CRC64; 



Query Match 75.0%; Score 6; DB 16; Length 324; 

Best Local Similarity 100.0%; Pred. No. 71; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


2 VNLDAE 7 








Db 


1 1 E 1 1 1 

146 VNLDAE 151 








RESULT 14 








Q9HVW0 








ID 


Q9HVW0 PRELIMINARY; PRT; 32 6 AA. 








AC 


Q9HVW0; 








DT 


01-MAR-2001 (TrEMBLrel. 16, Created) 








DT 


01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 








DT 


01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 








DE 


Hypothetical protein PA4457. 








GN 


PA4457 . 








OS 


Pseudomonas aeruginosa. 








OC 


Bacteria; Proteobacteria ; Gammaproteobacteria ; Pseudomonadales ; 






OC 


Pseudomonadaceae ; Pseudomonas. 








OX 


NCBI TaxID=287; 








RN 


[1] 








RP 


SEQUENCE FROM N.A. 








RC 


STRAIN=ATCC 15692 / PAOl; 








RX 


MEDLINE-2 0437 337 ; PubMed=10 9 84 04 3 ; 








RA 


Stover C.K., Pham X.-Q.T., Erwin A.L., Mizoguchi S.D., Warrener 


P. 


m 

r 


RA 


Hickey M.J., Brinkman F.S.L., Hufnagle W.O., Kowalik D.J., 


Lagrou 


M . , 

r 


RA 


Garber R.L., Goltry L . , Tolentino E., Westbrock-Wadman S., 


Yuan 


Y. 


f 


RA 


Brody L.L., Coulter S.N., Folger K.R., Kas A., Larbig K. , 


Lim R. 


M. 


/ 


RA 


Smith K.A. , Spencer D.H., Wong G.K.-S., Wu Z., Paulsen I.T 


• r 






RA 


Reizer J., Saier M.H., Hancock R.E.W., Lory S., Olson M.V. 


• 






RT 


"Complete genome sequence of Pseudomonas aeruginosa PAOl, 


an 






RT 


opportunistic pathogen."; 








RL 


Nature 4 06:959-9 64(2 000). 








DR 


EMBL; AE004860; AAG07845.1; -. 








DR 


PIR; A83087; A83087. 








DR 


GO; GO: 0005529; F: sugar binding; IEA. 








DR 


GO; GO: 0005975; P : carbohydrate metabolism; IEA. 








DR 


InterPro; IPR000644; CBS domain. 








DR 


InterPro; IPR004800; KpsF. 








DR 


InterPro; IPR001347; SIS. 








DR 


Pfam; PF00571; CBS; 2. 








DR 


Pfam; PF01380; SIS; 1. 








DR 


SMART; SM00116; CBS; 2. 








DR 


TIGRFAMs ; TIGR00393; kpsF; 1 . 








KW 


Hypothetical protein; Complete proteome. 








SQ 


SEQUENCE 326 AA; 34197 MW; 510B3190C0F92C51 CRC64; 









Query Match 75.0%; Score 6; DB 16; Length 326; 

Best Local Similarity 100.0%; Pred. No. 71; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



1 EVNLDA 6 



Db 139 EVNLDA 144 



RESULT 15 
Q9AHQ9 

ID Q9AHQ9 PRELIMINARY; PRT; 352 AA. 

AC Q9AHQ9; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE NIO-f ormyltetrahydrof olate synthetase (EC 6.3.4.3) (Fragment). 

OS Gamma-proteobacterium Hot 7 5m4 . 

OC Bacteria; environmental samples. 

OX NCBI_TaxID=7 7133; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 114 18 62 ; PubMed=l 122 9 939 ; 

RA Leaphart A.B., Lovell C.R.; 

RT "Recovery and Analysis of Formyltetrahydrof olate Synthetase Gene 

RT Sequences from Natural Populations of Acetogenic Bacteria."; 

RL Appl. Environ. Microbiol. 67:1392-1395(2 001). 



DR EMBL; AF2 95714 

DR GO; GO: 0005524 

DR GO; GO: 0004329 

DR GO; GO: 0016874 

DR GO; GO: 0009396; 



AAK20259.1; 
F : ATP binding; IEA. 

F : formate-tetrahydrof olate ligase activity; IEA. 
F:ligase activity; IEA. 

P: folic acid and derivative biosynthesis; IEA. 
DR InterPro; IPR000559; Fmtethyd_synth . 
DR Pfam; PF01268; FTHFS ; 1. 
DR PROSITE; PS00722; FTHFS_2 ; 1. 
KW Ligase. 

FT NON_TER 1 1 

FT NON_TER 352 352 

SQ SEQUENCE 352 AA; 37942 MW; FC473D2236908EC9 CRC64; 

Query Match 75.0%; Score 6; DB 2; Length 352; 

Best Local Similarity 100.0%; Pred. No. 76; 

Matches 6 ; Conservative 0 ; Mismatches 0 ; Indels 0 ; Gaps 0 ; 

Qy 1 EVNLDA 6 

I I II I I 

Db 215 EVNLDA 22 0 



RESULT 16 
Q8H0A8 

ID Q8H0A8 PRELIMINARY; PRT; 357 AA. 

AC Q8H0A8; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Putative gibberellin 20-oxidase. 

GN OSJNBA0096G08 . 8 . 

OS Oryza sativa (japonica cul tivar-group ) . 

OC Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta ; 

OC Spermatophyta ; Magnoliophyta ; Liliopsida; Poales ; Poaceae; 

OC Ehrhartoideae ; Oryzeae; Oryza. 



OX NCBI_TaxID=39947; 

RN CI] 

RP SEQUENCE FROM N . A. 

RC STRAIN^cv. Nipponbare; 

RA Buell C.R., Yuan Q. , Ouyang S., Liu J., Gansberger K. , Kim M.M., 

RA Overton II L.L., Bera J. J., Tsitrin T . , Krol M.I., Jarrahi B.B., 

RA Jin S.S., Koo H., Zismann V. , Hsiao J., Blunt S., Vanaken S.S., 

RA Utterback T . T . , Feldblyum T.V., Yang Q.Q., Haas B.J., Suh B.B., 

RA Peterson J. J., Quackenbush J., White O., Salzberg S.L., Fraser CM.; 

RT "Oryza sativa chromosome 10 BAC OS JNBa0096G08 genomic sequence."; 

RL Submitted (DEC-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Nipponbare; 

RA The Rice Chromosome 10 Sequencing Consortium; 

RT "In-depth view of structure,, activity, and evolution of rice 

RT chromosome 10."; 

RL Science 300:1566-1569(2 003). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Nipponbare; 

RA Buell C.R., Wing R.A. , McCorabie W.R., Messing J., Yuan Q.; 

RL Submitted (MAY-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AC078894; AAO00707.1; -. 

DR EMBL; AE017113; AAP54689.1; -. 

DR GO; GO: 0005506; F:iron ion binding; IEA. 

DR GO; GO: 0016216; F : isopenicillin— N synthase activity; IEA. 

DR GO; GO: 0017000; Prantibiotic biosynthesis; IEA. 

DR InterPro; IPR005123; 20G-FeI I_Oxy . 

DR InterPro; IPR002283; IPN_synth. 

DR Pfam; PF03171; 20G-FeII_Oxy ; 1. 

DR PRINTS; PR00682; I PNSYNTHASE . 

SQ SEQUENCE 357 AA; 39386 MW; 3 1 9BE07F54FA6DC0 CRC64; 

Query Match 75.0%; Score 6; DB 10; Length 357; 

Best Local Similarity 100.0%; Pred. No. 77; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 NLDAEF 8 

I I I I I I 

Db 198 NLDAEF 203 



RESULT 17 
Q8LNJ6 

ID Q8LNJ6 PRELIMINARY; PRT; 381 AA. 

AC Q8LNJ6; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Putative gibberellin oxidase. 

GN OSJNBB0 02 8C01 . 33 . 

OS Oryza sativa (japonica cultivar-group ) . 

OC Eukaryota; Viridiplantae; S t reptophyta ; Embryophyta; Tracheophyta ; 

OC Spermatophyta ; Magnoliophyta ; Liliopsida; Poales; Poaceae; 

OC Ehrhartoideae ; Oryzeae; Oryza. 

OX NCBI TaxID=39947; 



RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=cv. Nipponbare; 

RA Buell C.R., Yuan Q. , Ouyang S., Liu J., Gansberger K . , Kim M.M., 

RA Overton II L.L., Bera J. J., Tsitrin T . , Krol M.I., Jarrahi B.B., 

RA Jin S.S., Koo H., Zismann V., Hsiao J . , Blunt S., Vanaken S.S., 

RA Utterback T . T . , Feldblyum T.V., Yang Q.Q., Haas B.J., Suh B.B., 

RA Peterson J. J., Quackenbush J., White 0., Salzberg S.L., Fraser CM.; 

RT "Oryza sativa chromosome 10 BAC OS JNBb002 8C01 genomic sequence."; 

RL Submitted (AUG-2 002) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Nipponbare; 

RA The Rice Chromosome 10 Sequencing Consortium; 

RT ,f In-depth view of structure, activity, and evolution of rice 

RT chromosome 10."; 

RL Science 300:1566-15 69(2 003). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Nipponbare; 

RA Buell C.R., Wing R.A. , McCombie W.R., Messing J., Yuan Q.; 

RL Submitted (MAY-2003) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AC079029; AAM92286.1; -. 

DR EMBL; AE017113; AAP54688.1; -. 

DR Gramene; Q8LNJ6; -. 

DR GO; GO: 0005506; F:iron ion binding; IEA. 

DR GO; GO: 0016216; F : isopenicillin-N synthase activity; IEA. 

DR GO; GO:0017000; P : antibiotic biosynthesis ; IEA. 

DR InterPro; IPR005123; 20G-FeII_Oxy . 

DR InterPro; IPR002283; IPN_synth. 

DR Pfam; PF03171; 20G-FeII_Oxy ; 1. 

DR PRINTS; PR00682; I PNSYNTHASE . 

SQ SEQUENCE 381 AA; 42401 MW; 7F2 180214 1337B55 CRC64; 

Query Match 7 5.0%; Score 6; DB 10; Length 381; 

Best Local Similarity 100.0%; Pred. No. 82; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 NLDAEF 8 

Mill! 

Db 22 8 NLDAEF 2 33 



RESULT 18 
Q 8 AVI 2 

ID Q8AVI2 PRELIMINARY; PRT; 436 AA. 

AC Q8AVI2; 

DT 01-MAR-2003 { TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Similar to keratin 18 (Fragment) . 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Amphibia; Batrachia; Anura; Mesobat rachia ; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI_TaxID=8 355; 

RN [1] 



RP SEQUENCE FROM N.A. 

RC TISSUE=Embryo; 

RA Klein S . , Strausberg R. ; 

RL Submitted (JAN-2003) to the EMBL/GenBank/DDB J databases. 

DR EMBL; BC042269; AAH42269.1; 

DR GO; GO: 0005882; C : intermediate filament; IEA. 

DR InterPro; IPR001664; IF. 

DR InterPro; IPR002957; Keratin_I . 

DR Pfam; PF00038; filament; 1. 

DR PRINTS; PR01248; TYPElKERATIN . 

DR PROSITE; PS00226; IF; 1. 

KW Keratin. 

FT NON_TER 1 1 

SQ SEQUENCE 436 AA; 48322 MW; F34 66A556C5B72 9D CRC64; 



Query Match 75.0%; Score 6; DB 13; Length 436; 

Best Local Similarity 100.0%; Pred. No. 92; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDAE 7 

I I 1 I I I 

Db 288 VNLDAE 2 93 



RESULT 19 






Q87UP7 






ID 


Q87UP7 PRELIMINARY; PRT; 500 AA. 






AC 


Q87UP7 ; 






DT 


01-JUN-2003 { TrEMBLrel . 24, Created) 






DT 


01-JUN-2003 (TrEMBLrel . 24, Last sequence update) 






DT 


01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 






DE 


Exopolyphosphatase . 






GN 


PPX OR PSPT05249. 






OS 


Pseudomonas syringae (pv. tomato) . 






OC 


Bacteria; Proteobacteria; Gammaproteobacteria ; Pseudomonadales ; 




OC 


Pseudomonadaceae ; Pseudomonas. 






OX 


NCBI TaxID=323; 






RN 


[1] 






RP 


SEQUENCE FROM N.A. 






RC 


STRAIN=DC3000; 






RA 


Buell R. , Joardar V., Khouri H., Fedorova N . , Tran B., 


Russell D. 


r 


RA 


Berry K., Utterback T., Van Aken S., Feldblyum T., Gwinn M. , 




RA 


Dodson R. , DeBoy R. , Durkin A., Kolonay J., Madupu R. , 


Daugherty 


S. , 


RA 


Brinkac L . , Beanan M. , Haft D., Selengut J., Nelson W., 


Davidsen 


T. , 


RA 


White O., Fraser C, Collmer A. ; 






RT 


"Complete sequence of Pseudomonas syringae."; 






RL 


Submitted (MAR-2003) to the EMBL/GenBank/DDB J databases 


• 




DR 


EMBL; AE016875; AA058675.1; 






DR 


TIGR; PSPT05249; - . 






DR 


GO; GO: 0008784; F: alanine racemase activity; IEA. 






DR 


GO; GO: 0006522; Pralanine metabolism; IEA. 






DR 


InterPro; IPR000821; Ala_racemase . 






DR 


InterPro; IPR003695; Ppx GppA. 






DR 


Pfam; PF02541; Ppx-GppA; 1. 






DR 


PROSITE; PS00395; ALAN I N E_RAC EMAS E ; 1. 






KW 


Complete proteome. 






SQ 


SEQUENCE 500 AA; 56248 MW; 1F3123C84 83632DB CRC64; 







Query Match 75.0%; Score 6; DB 16; Length 500; 

Best Local Similarity 100.0%; Pred. No. le+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gap 

Qy 3 NLDAEF 8 

I I I I I I 

Db 463 NLDAEF 468 



RESULT 2 0 
Q97XQ7 

ID Q97XQ7 PRELIMINARY; PRT; 802 AA. 

AC Q97XQ7; 

DT 01-OCT-2001 (TrEMBLrel. 18, Created) 

DT 01-OCT-2001 (TrEMBLrel. 18, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein SS01653. 

GN SSQ1653. 

OS Sulfolobus solf atari cus . 

OC Archaea; Crenarchaeota; Thermoprotei ; Sulf olobales ; Sulf olobaceae ; 

OC Sulfolobus. 

OX NCBI_TaxID=22 87; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ATCC 35092 / DSM 1617 / P2; 

RX MEDLINE=2 13322 96; PubMed=l 142 772 6 ; 

RA She Q . , Singh R.K., Confalonieri F. , Zivanovic Y. , Allard G., 

RA Awayez M.J., Chan-Weiher C.C.-Y., Clausen I.G., Curtis B.A., 

RA De Moors A., Erauso G., Fletcher C, Gordon P.M.K., 

RA Heikamp-de Jong I., Jeffries A.C., Kozera C.J., Medina N., Peng X., 

RA Thi-Ngoc H.P., Redder P., Schenk M.E., Theriault C, Tolstrup N., 

RA Charlebois R.L., Doolittle W.F., Duguet M. , Gaasterland T . , 

RA Garrett R.A., Ragan M.A., Sensen C.W., Van der Oost J.; 

RT "The complete genome of the crenarchaeon Sulfolobus solfataricus P2 

RL Proc. Natl. Acad. Sci. U.S.A. 98:7835-7 840(2001). 

DR EMBL; AE006779; AAK41866.1; 

DR PIR; C90326; C90326. 

DR GO; GO: 0005524; F : ATP binding; IEA. 

DR GO; GO: 0008026; F : ATP dependent helicase activity; IEA. 

DR GO; GO:0003677; F : DNA binding; IEA. 

DR InterPro; IPR001410; DEAD. 

DR InterPro; IPR000330; SNF2_N . 

DR Pf am; PF00176; SNF2_N; 1 . 

DR SMART; SM00487; DEXDc; 1. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 802 AA; 92852 MW; 8 0F8 6655C9F5 04 OA CRC64; 

Query Match 75.0%; Score 6; DB 17; Length 802; 

Best Local Similarity 100.0%; Pred. No. 1 . 6e+02 ; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gap 

Qy 1 EVNLDA 6 

I I I I I I 

Db 2 92 EVNLDA 2 97 



RESULT 21 
Q9VI82 

ID Q9VI82 PRELIMINARY; PRT; 822 AA. 

AC Q9VI82; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE CG1234 protein (LD35257P) . 

GN CG1234. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophil idae ; Drosophila. 

OX NCBI_TaxID=7227; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=Berkeley; 

RX MEDLINE=2 019 60 06; PubMed=10731132 ; 

RA Adams M.D., Celniker S.E., Holt R.A. , Evans C.A., Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A. , Galle R. F. , 

RA George R.A., Lewis S.E., Richards S., Ashburner M. , Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q . , Chen L.X., 

RA Brandon R.C., Rogers Y.-H.C, Blazej R.G., Champe M. , Pfeiffer B.D., 

RA Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G. 

RA Abril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D., 

RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A., Butler H., Cadieu E . , Center A., Chandra I. 

RA Cherry J.M., Cawley S., Dahlke C, Davenport L.B., Davies P., 

RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., 

RA Dodson K., Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P 

RA Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., 

RA Glodek A., Gong F. , Gorrell J.H., Gu Z . , Guan P., Harris M. , 

RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., 

RA Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C, 

RA Jalali M. , Kalush F . , Karpen G.H., Ke Z . , Kennison J. A., Ketchum K.A 

RA Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D., Lai Z., 

RA Lasko P., Lei Y. , Levitsky A. A., Li J., Li Z., Liang Y. , Lin X., 

RA Liu X., Mattel B . , Mcintosh T.C., McLeod M.P., McPherson D . , 

RA Merkulov G., Milshina N.V., Mobarry C, Morris J., Moshrefi A., 

RA Mount S.M., Moy M. , Murphy B . , Murphy L . , Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A. , Nixon K., Nusskern D.R., Pacleb J.M. , 

RA Palazzolo M. , Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., 

RA Reinert K., Remington K., Saunders R.D.C., Scheeler F. , Shen H., 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T., 

RA Spier E., Spradling A.C., Stapleton M. , Strong R. , Sun E . , 

RA Svirskas R. , Tector C, Turner R. , Venter E., Wang A.H., Wang X., 

RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., 

RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A. , 

RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M. , Zhang G. , Zhao Q . , Zheng L., 

RA Zheng X.H., Zhong F.N., Zhong W. , Zhou X., Zhu S., Zhu X., Smith H.O 

RA Gibbs R.A., Myers E.W., Rubin G.M. , Venter J.C.; 

RT "The genome sequence of Drosophila melanogaster."; 

RL Science 2 8 7:2185-2195(2000). 

RN [2] 



RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; T 

, w n^v^pin P Honq L . , Agbayanx A. , Carlson J., 
RA Stapleton M. , BroKstem r • * nuny ' y> * _ . _ „ r _ _ rrrp p 

-n^y-ci^t-t v Farfan D., Frise E., George k- , 
RA Champe M. , Chavez C, Dorsett v., cajLi.au v., 

ka ^iiampc , . Tj Li P , Liao G., Miranda A., Mungall C.J., 

s r»Zo"X£"?. ^:,-v., p« k ... ... .» k 

RA Yu C , Lewis S.E., Rubin G.M. , Celniker S.; 
RL Submitted (AUG-2001) to the EMBL/GenBank/DDB J databases. 
DR EMBL; AE003672; AAF54043.1; -. 
EMBL; AY051854; AAK93278.1; -. 



DR 



DR FlyBase; FBgnO037489; CG1234 ^ TH t-w tfa 

DR GO; GO: 0004197; F: cys teine-type endopeptidase aC ^ lty ' IEA ' 

DR GO; GO:0006508; P : proteolysis and peptidolysis; IEA. 

DR InterPro; IPR008938; ARM. 

DR InterPro; IPR005612; CBF. 

DR InterPro; IPR000169; SHprot_acsite . 

DR Pfam; PF03914; CBF; 1. 

DR PROSITE; PS00639; THIOL PROTEASE_HIS ; 1. 

SQ SEQUENCE 822 AA; 944?2 MW; B6B724F9F83A4BC8 CRC64 ; 

Query Match 75.0%; Score 6; DB 5; Length 822; 

Best Local Similarity 100.0%; Pred. No. 1.6e+02; 
Matches 6; Conservative 0; Mismatches 0; Indels 

Qy 2 VNLDAE 7 

1 II I I I 

D b 449 VNLDAE 4 54 



RESULT 22 
Q961M8 
ID 



DT 
DT 



Q961M8 PRELIMINARY; PRT ; 839 AA. 

AC Q961M8; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

01-DEC-2001 (TrEMBLrel . 19, Last sequence update) 
01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE GHl5453p. 

GN DHC64C OR CG7507. 

OS Drosophila melanogaster (Fruit fly). D . ... 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota, 

OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae ; Drosophila. 

OX NCBI_TaxID=7227 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; T 

RA Stapleton M. , Brokstein P., Hong L., Agbayani A. , C rlso 

RA Champe M. , Chavez C, Dorsett V., Farfan D., Frise E George R 

KA unamp^ , . p L±ao G Miranda A., Mungall C.J 

RA Gonzalez M. , Guarin H . , JjI ^ • / j-.x<*w o , „ w 

RA Nunoo J. , Pacleb J., Paragas V., Park S., Phouanenavong S., Wan 

RA Yu C Lewis S.E., Rubin G.M., Celniker S.; 

RL Submitted (AUG-2001) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AY051501; AAK92925.1; 

DR FlyBase; FBgn0010349; Dhc64C. 

DR GO; GO: 0045169; C:fusome; NAS . 

DR GO; GO: 0008567; F:dynein ATPase activity; IDA. 

DR GO; GO: 0003777; F : microtubule motor activity; IDA. 



DR GO; GO:0007098; Prcentrosome cycle; IMP. 

DR GO; GO: 0007292; P : female gamete generation; IMP. 

DR GO; GO: 0045478; P : fusome organization and biogenesis; NAS . 

DR GO; GO: 0007018; P : microtubule-based movement; IDA. 

DR GO; GO:0007294; P:oocyte cell fate determination (sensu Insecta); IMP 

DR InterPro; IPR004273; Dynein_heavy . 

DR Pfam; PF03028; Dynein_heavy ; 1. 

SQ SEQUENCE 839 AA; 94472 MW; 03 8B8 1350D7 06C0C CRC64; 

Query Match 75.0%; Score 6; DB 5; Length 839; 

Best Local Similarity 100.0%; Pred. No. 1.6e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 3 NLDAEF 8 

MINI 

Db 8 5 NLDAEF 90 



RESULT 23 
Q9KRY9 

ID Q9KRY9 PRELIMINARY; PRT; 8 68 AA. 

AC Q9KRY9; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 

DT 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-OCT-2003 {TrEMBLrel. 25, Last annotation update) 

DE Aminopeptidase N. 

GN VC14 94. 

OS Vibrio cholerae. 

OC Bacteria; Proteobacteria ; Gammaproteobacteria ; Vibrionales; 

OC Vibrionaceae ; Vibrio. 

OX NCBI_TaxID=666; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=El Tor N16961 / Serotype Ol; 

RX MEDLINE=2 04 0 6833; PubMed=10 952301 ; 

RA Heidelberg J.F., Eisen J. A. , Nelson W.C., Clayton R.A. , Gwinn M.L., 

RA Dodson R.J., Haft D.H., Hickey E.K., Peterson J.D., Umayam L.A. , 

RA Gill S.R., Nelson K.E., Read T.D., Tettelin H., Richardson D., 

RA Ermolaeva M.D., Vamathevan J., Bass S., Qin H., Dragoi I., Sellers P 

RA McDonald L., Utterback T., Fleischmann R.D., Nierman W.C., White O., 

RA Salzberg S.L., Smith H.O., Colwell R.R., Mekalanos J.J., Venter J.C. 

RA Fraser C .M. ; 

RT "DNA sequence of both chromosomes of the cholera pathogen Vibrio 

RT cholerae . " ; 

RL Nature 4 06:477-483(2000). 

DR EMBL; AE004227; AAF94649.1; -. 

DR PIR; G82193; G82193. 

DR MEROPS; M01.005; -. 

DR TIGR; VC1494; -. 

DR GO; GO: 0004179; F:membrane alanyl aminopeptidase activity; IEA. 

DR GO; GO: 0008237; F : metal lopeptidase activity; IEA. 

DR GO; GO: 0008270; F: zinc ion binding; IEA. 

DR GO; GO: 0006508; P : proteolysis and peptidolysis ; IEA. 

DR InterPro; IPR001930; Peptidase_Ml . 

DR InterPro; IPR006025; Pept_M_Zn_BS . 

DR Pfam; PF01433; Peptidase_Ml ; 1. 

DR PRINTS; PR00756; ALADI PTASE . 



DR PROSITE; PS00142; ZINC_PROTEASE; 1. 
KW Complete proteome . 

SQ SEQUENCE 868 AA; 98404 MW; 8 1A5 6B4B7A0 0A82F CRC64; 

Query Match 75.0%; Score 6; DB 16; Length 868; 

Best Local Similarity 100.0%; Pred. No. 1.7e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 3 NLDAEF 8 

I I I i I I 

Db 606 NLDAEF 611 



RESULT 24 
Q7WMP8 

ID Q7WMP8 PRELIMINARY; PRT; 894 AA. 

AC Q7WMP8; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Aconitate hydratase {EC 4.2.1.3). 

GN ACN OR ACNA OR BB1342 . 

OS Bordetella bronchiseptica (Alcaligenes bronchisepticus ) . 

OC Bacteria; Proteobacteria ; Betaproteobacteria ; Burkholderiales ; 

OC Alcaligenaceae ; Bordetella. 

OX NCBI_TaxID=518 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=RB5 0 / ATCC BAA-58 8; 

RX MEDLINE=22 827 954 ; PubMed=12 91 027 1 ; 

RA Parkhill J., Sebaihia M. , Preston A. r Murphy L.D., Thomson N., 

RA Harris D.E., Holden M.T.G., Churcher CM., Bentley S.D., Mungall K.L. 

RA Cerdeno-Tar raga A.M., Temple L., James K. , Harris B., Quail M.A. , 

RA Achtman M. , Atkin R. , Baker S., Basham D., Bason N., Cherevach I., 

RA Chillingworth T., Collins M. , Cronin A., Davis P., Doggett J., 

RA Feltwell T., Goble A., Hamlin N., Hauser H., Holroyd S., Jagels K. , 

RA Leather S., Moule S., Norberczak H., O'Neil S., Ormond D . , Price C, 

RA Rabbinowits ch E., Rutter S., Sanders M. , Saunders D., Seeger K., 

RA Sharp S., Sirnmonds M. , Skelton J., Squares R. , Squares S., Stevens K. 

RA Unwin L., Whitehead S., Barrell B.G., Maskell D.J.; 

RT "Comparative analysis of the genome sequences of Bordetella pertussis 

RT Bordetella parapertussis and Bordetella bronchiseptica."; 

RL Nat. Genet. 35:32-40(2 003). 

DR EMBL; BX640441; CAE31840.1; -. 

KW Lyase; Complete proteome. 

SQ SEQUENCE 894 AA; 96617 MW; 560A1DE58317C8 82 CRC64; 

Query Match 75.0%; Score 6; DB 16; Length 894; 

Best Local Similarity 100.0%; Pred. No. 1.7e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 3 NLDAEF 8 

I I I I I I 

Db 14 7 NLDAEF 152 



RESULT 2 5 



Q7WB80 

ID Q7WB8 0 PRELIMINARY; PRT; 8 94 AA. 

AC Q7WB80; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Aconitate hydratase (EC 4.2.1.3). 

GN ACN OR ACNA OR BPP1126. 

OS Bordetella parapertussis. 

OC Bacteria; Proteobacteria ; Betaproteobacteria ; Burkholderiales ; 

OC Alcaligenaceae ; Bordetella. 

OX NCBI_TaxID=519; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=12822 / ATCC BAA-587; 

RX MEDLINE=22 82 7 954; PubMed-12 91027 1 ; 

RA Parkhill J . , Sebaihia M. , Preston A., Murphy L.D., Thomson N., 

RA Harris D.E., Holden M.T.G., Churcher CM., Bentley S.D., Mungall K.L. 

RA Cerdeno-Tarraga A.M., Temple L . , James K., Harris B., Quail M.A. , 

RA Achtman M. , Atkin R. , Baker S., Basham D . , Bason N., Cherevach I., 

RA Chillingworth T., Collins M. , Cronin A., Davis P., Doggett J., 

RA Feltwell T., Goble A., Hamlin N., Hauser H., Holroyd S., Jagels K. , 

RA Leather S., Moule S., Norberczak H., O'Neil S., Ormond D., Price C, 

RA Rabbinowitsch E., Rutter S., Sanders M. , Saunders D., Seeger K., 

RA Sharp S., Simmonds M. , Skelton J., Squares R. , Squares S., Stevens K. 

RA Unwin L., Whitehead S., Barrell B.G., Maskell D.J.; 

RT "Comparative analysis of the genome sequences of Bordetella pertussis 

RT Bordetella parapertussis and Bordetella bronchiseptica . " ; 

RL Nat. Genet. 35:32-40(2003). 

DR EMBL; BX640426; CAE36427.1; 

KW Lyase; Complete proteome. 

SQ SEQUENCE 894 AA; 96559 MW; D13 1C5250 10B3C7 5 CRC64; 

Query Match 75.0%; Score 6; DB 16; Length 894; 

Best Local Similarity 100.0%; Pred. No. 1.7e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 3 NLDAEF 8 

I I I I M 

Db 147 NLDAEF 152 



RESULT 2 6 
Q8DL57 

ID Q8DL57 PRELIMINARY; PRT; 1324 AA. 

AC Q8DL57; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 
Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 
DE RNA polymerase beta prime subunit. 
GN RPOC2 OR TLL0640. 

Synechococcus elongatus (Thermosynechococcus elongatus). 
Bacteria; Cyanobacteria; Chroococcales ; Synechococcus . 
OX NCBI_TaxID=32 04 6; 
RN [1] 

RP SEQUENCE FROM N.A. 
RC STRAIN=BP-1; 



DT 
DT 



OS 
OC 



RX MEDLINE=2222 514 4; PubMed=1224 08 34 ; 

RA Nakamura Y. , Kaneko T., Sato S., Ikeuchi M. , Katoh H., Sasamoto S., 

RA Watanabe A., Iriguchi M. , Kawashima K., Kimura T . , Kishida Y. , 

RA Kiyokawa C, Kohara M. , Matsumoto M. , Matsuno A., Nakazaki N . , 

RA Shimpo S., Sugimoto M. , Takeuchi C, Yamada M. , Tabata S . ; 

RT "Complete genome structure of the thermophilic cyanobacterium 

RT Thermosynechococcus elongatus BP-1." ; 

RL DNA Res. 9:123-130(2002). 

DR EMBL; AP005371; BAC08191.1; -. 

DR GO; GO:0003899; F : DNA-directed RNA polymerase activity; IEA. 

DR GO; GO: 0006350; P: transcription; IEA. 

DR InterPro; IPR007066; RNA__pol_Rpbl_3 . 

DR InterPro; IPR007083; RNA_pol_Rpbl_4 . 

DR InterPro; IPR007081; RNA_pol_Rpbl_5 . 

DR Pfam; PF04983; RNA_pol_Rpbl_3 ; 1. 

DR Pfam; PF05000; RNA_pol_Rpbl_4 ; 1. 

DR Pfam; PF04998; RNA_pol_Rpbl_5 ; 1. 

KW Complete proteome. 

SQ SEQUENCE 1324 AA; 144701 MW; 2A82FE2C94B54 976 CRC64 ; 

Query Match 75.0%; Score 6; DB 16; Length 1324; 

Best Local Similarity 100.0%; Pred. No. 2.4e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 2 VNLDAE 7 

MINI 

Db 454 VNLDAE 459 



RESULT 2 7 
Q8INT5 

ID Q8INT5 PRELIMINARY; PRT; 1469 AA. 

AC Q8INT5; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CG31623-PA. 

GN DTR OR CG31623. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda ; Insects; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae ; Drosophila. 

OX NCBIJTaxID=7227; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=2 019 6006; PubMed=l 07 3 1132 ; 

RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R. A. , Galle R.F., 

RA George R.A. , Lewis S.E., Richards S., Ashburner M. , Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q . , Chen L.X., 

RA Brandon R.C., Rogers Y.H., Blazej R.G., Champe M. , Pfeiffer B.D., 

RA Wan K.H., Doyle C, Baxter E.G., Helt G . , Nelson C.R., Gabor G.L., 

RA Abril J.F., Agbayani A., An H.J., Andrews-Pf annkoch C, Baldwin D., 

RA Ballew R.M. , Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D , , Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., 



RA Cherry J.M. , Cawley S., Dahlke C, Davenport L.B., Davies P., 

RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., 

RA Dodson K., Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P., 

RA Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S . , Fleischmann W., 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., 

RA Glodek A., Gong F. , Gorrell J.H., Gu Z . , Guan P., Harris M. , 

RA Harris N . L . , Harvey D., Heiman T.J., Hernandez J.R., Houck J., 

RA Hostin D., Houston K.A., Howland T.J., Wei M.H., Ibegwam C, 

RA Jalali M., Kalush F., Karpen G.H., Ke Z., Kennison J. A., Ketchum K-A. , 

RA Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D . , Lai Z., 

RA Lasko P., Lei Y., Levitsky A. A. , Li J., Li Z., Liang Y., Lin X., 

RA Liu X., Mattei B., Mcintosh T.C., McLeod M.P., McPherson D., 

RA Merkulov G., Milshina N.V., Mobarry C, Morris J., Moshrefi A., 

RA Mount S.M., Moy M. , Murphy B., Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R., Pacleb J.M., 

RA Palazzolo M. , Pittman G.S., Pan S-, Pollard J., Puri V., Reese M.G., 

RA Reinert K., Remington K., Saunders R.D., Scheeler F. , Shen H. f 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T . , 

RA Spier E . , Spradling A.C., Stapleton M. , Strong R. , Sun E., 

RA Svirskas R. , Tector C, Turner R. , Venter E., Wang A.H., Wang X., 

RA Wang Z.Y., Wassarman D . A. , Weinstock G.M. , Weissenbach J., 

RA Williams S.M., WoodageT, Worley K.C., Wu D., Yang S., Yao Q.A., Ye J., 

RA Yeh R.F., Zaveri J.S., Zhan M. , Zhang G . , Zhao Q . , Zheng L., 

RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., 

RA Gibbs R.A., Myers E.W., Rubin G.M. , Venter J.C.; 

RT "The genome sequence of Drosophila melanogas ter . " ; 

RL Science 2 87:2185-2195(2000). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Celniker S.E., Adams M.D., Kronmiller B . , Wan K.H., Holt R.A., 

RA Evans C.A., Gocayne J.D., Amanatides P.G., Brandon R.C., Rogers Y., 

RA Banzon J., An H . , Baldwin D., Banzon J., Beeson K.Y., Busam D.A., 

RA Carlson J.W., Center A., Champe M. , Davenport L.B., Dietz S.M., 

RA Dodson K., Dorsett V., Doup L.E., Doyle C, Dresnek D., Farfan D., 

RA Ferriera S., Frise E., Galle R.F., Garg N.S., George R.A. , 

RA Gonzalez M., Houck J., Hoskins R.A. , Hostin D., Howland T.J., 

RA Ibegwam C, Jalali M. , Kruse D., Li P., Mattei B . , Moshrefi A., 

RA Mcintosh T.C., Moy M. , Murphy B., Nelson C, Nelson K.A., Nunoo J., 

RA Pacleb J., Paragas V., Park S., Patel S., Pfeiffer B . , 

RA Phouanenavong S., Pittman G.S., Puri V., Richards S. # Scheeler F. , 

RA Stapleton M. , Strong R. , Svirskas R. , Tector C, Tyler D., 

RA Williams S.M., Zaveri J.S., Smith H.O., Venter J.C., Rubin G.M.; 

RT "Sequencing of Drosophila melanogaster genome. "; 

RL Submitted (MAR-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Misra S., Crosby M. A., Matthews B.B., Bayraktaroglu L . , Campbell K . , 

RA Hradecky P., Huang Y., Kaminker J.S., Prochnik S.E., Smith CD., 

RA Tupy J.L., Bergman C, Berman B., Carlson J.W., Celniker S.E., 

RA Clamp M., Drysdale R. , Emmert D . , Frise E., de Grey A., Harris N., 

RA Kronmiller B., Marshall B., Millburn G. , Richter J., Russo S., 

RA Searle S.M.J. , Smith E . , Shu S., Smutniak F., Whitfield E., 

RA Ashburner M. , Gelbart W.M., Rubin G.M., Mungall C.J., Lewis S.E.; 

RT "Annotation of Drosophila melanogaster genome."; 

RL Submitted (MAR-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 



RA Adams M.D., Celniker S.E., Gibbs R.A. , Rubin G.M., Venter C.J.; 

RL Submitted (MAR-2000) to the EMBL/GenBank/DDB J databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RA FlyBase; 

RL Submitted (SEP-2002) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AE003670; AAN11119.1; 

DR FlyBase; FBgn0023090; dtr. 

DR InterPro; IPR001611; LRR. 

DR InterPro; IPR007092; LRR_SDS22 . 

DR Pfam; PF00560; LRR; 4. 

SQ SEQUENCE 1469 AA; 166591 MW; D42 1 8 12 69A4 651E1 CRC64; 

Query Match 75.0%; Score 6; DB 5; Length 1469; 

Best Local Similarity 100.0%; Pred. No. 2.7e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 3 NLDAEF 8 

I I I 1 I I 

Db 1016 NLDAEF 1021 



RESULT 2 8 
Q9VZ8 3 

ID Q9VZ83 PRELIMINARY; PRT; 4639 AA. 

AC Q9VZ83; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE CG7507 protein. 

GN DHC64C OR CG7507 . 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae ; Drosophila. 

OX NCBI_TaxID=7227 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; 

RX MEDLINE=20196006; PubMed=l 073 1132 ; 

RA Adams M.D., Celniker S.E., Holt R.A. , Evans C.A., Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A. , Galle R.F., 

RA George R.A. , Lewis S.E., Richards S., Ashburner M., Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q . , Chen L.X., 

RA Brandon R.C., Rogers Y.-H.C, Blazej R.G., Champe M., Pfeiffer B.D., 

RA Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G. 

RA Abril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D., 

RA Ballew R.M. , Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D . , Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A., Butler H., Cadieu E . , Center A., Chandra I. 

RA Cherry J.M., Cawley S., Dahlke C, Davenport L.B., Davies P., 

RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., 

RA Dodson K. , Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P 

RA Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K. , 

RA Glodek A., Gong F . , Gorrell J.H., Gu Z., Guan P., Harris M. , 



RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., 

RA Hostin D . , Houston K.A. , Howland T.J., Wei M.-H., Ibegwam C, 

RA Jalali M. , Kalush F. , Karpen G.H., Ke Z., Kennison J. A., Ketchum K.A. , 

RA Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D., Lai Z., 

RA Lasko P., Lei Y . , Levitsky A. A. , Li J., Li Z., Liang Y . , Lin X., 

RA Liu X., Mattei B., Mcintosh T.C, McLeod M.P., McPherson D., 

RA Merkulov C, Milshina N.V., Mobarry C, Morris J., Moshrefi A., 

RA Mount S.M., Moy M . , Murphy B . , Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A. , Nixon K., Nusskern D.R., Pacleb J.M., 

RA Palazzolo M. , Pittman G.S., Pan S., Pollard J., Puri V., Reese M.C, 

RA Reinert K. , Remington K., Saunders R.D.C, Scheeler F . , Shen H. , 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T., 

RA Spier E., Spradling A. C , Stapleton M. , Strong R. , Sun E . , 

RA Svirskas R. , Tector C, Turner R. , Venter E. f Wang A.H., Wang X., 

RA Wang Z.-Y., Wassarman D.A., Weinstock CM., Weissenbach J., 

RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A. f 

RA Ye J. , Yeh R.-F., Zaveri J.S., Zhan M. , Zhang G., Zhao Q., Zheng L., 

RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., 

RA Gibbs R.A., Myers E.W., Rubin CM., Venter J.C; 

RT "The genome sequence of Drosophila melanogas ter . " ; 

RL Science 287:2185-2195(2000). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Celniker S.E., Adams M.D., Kronmiller B., Wan K.H., Holt R.A. , 

RA Evans C.A., Gocayne J.D., Amanatides P.C, Brandon R.C, Rogers Y., 

RA Banzon J., An H . , Baldwin D., Banzon J., Beeson K.Y., Busam D.A., 

RA Carlson J.W. , Center A., Champe M. , Davenport L.B., Dietz S.M., 

RA Dodson K., Dorsett V., Doup L.E., Doyle C, Dresnek D., Farfan D., 

RA Ferriera S., Frise E., Galle R.F., Garg N.S., George R.A., 

RA Gonzalez M. , Houck J., Hoskins R.A. , Hostin D., Howland T.J., 

RA Ibegwam C, Jalali M. , Kruse D., Li P., Mattei B., Moshrefi A., 

RA Mcintosh T.C, Moy M. , Murphy B., Nelson C, Nelson K.A. , Nunoo J., 

RA Pacleb J., Paragas V., Park S., Patel S., Pfeiffer B. , 

RA Phouanenavong S., Pittman G.S., Puri V., Richards S., Scheeler F. , 

RA Stapleton M. , Strong R. , Svirskas R. , Tector C, Tyler D., 

RA Williams S.M., Zaveri J.S., Smith H.O., Venter J.C, Rubin CM. ; 

RT "Sequencing of Drosophila melanogaster genome."; 

RL Submitted (MAR-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Misra S., Crosby M.A. , Matthews B.B., Bayraktaroglu L . , Campbell K . , 

RA Hradecky P., Huang Y., Kaminker J.S., Prochnik S.E., Smith CD., 

RA Tupy J.L., Bergman C, Berman B., Carlson J.W., Celniker S.E., 

RA Clamp M. , Drysdale R. , Emmert D., Frise E., de Grey A., Harris N . , 

RA Kronmiller B., Marshall B., Millburn C, Richter J., Russo S., 

RA Searle S.M.J., Smith E . , Shu S., Smutniak F., Whitfield E . , 

RA Ashburner M. , Gelbart W.M., Rubin G.M., Mungall C.J., Lewis S.E.; 

RT "Annotation of Drosophila melanogaster genome."; 

RL Submitted (MAR-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Adams M.D., Celniker S.E., Gibbs R.A. , Rubin G.M., Venter C.J.; 

RL Submitted (MAR-2000) to the EMBL/ GenBank/ DDB J databases. 
RN [5] 

RP SEQUENCE FROM N.A. 

RA FlyBase; 

RL Submitted (SEP-2002) to the EMBL/ GenBank/ DDB J databases. 



DR EMBL; AE003482; AAF47942.3; -. 
DR FlyBase; FBgn0010349; Dhc64C. 



DR 


GO; 


GO: 


0045169; 


C 


: f usome ; NAS . 


DR 


GO; 


GO: 


0008567; 


F 


rdynein ATPase activity; IDA. 


DR 


GO; 


GO: 


0003777; 


F 


microtubule motor activity; IDA. 


DR 


GO; 


GO: 


0007098; 


P 


:centrosome cycle; IMP. 


DR 


GO; 


GO: 


0007292; 


P 


: female gamete generation; IMP. 


DR 


GO; 


GO: 


0045478; 


P 


:f usome organization and biogenesis; NAS. 


DR 


GO; 


GO: 


0007018; 


P 


: microtubule-based movement; IDA. 


DR 


GO; 


GO: 


0007294; 


P 


: oocyte cell fate determination (sensu Insecta) ; IMP. 



DR InterPro; IPR003593; AAA_ATPase. 

DR InterPro; IPR001064; Crystallin. 

DR InterPro; IPR004273; Dynein_heavy . 

DR InterPro; IPR000169; SHprot_acsi te . 

DR Pfam; PF03028 ; Dynein_heavy ; 1 . 

DR SMART; SM00382; AAA; 4. 

DR PROSITE; PS00225; C R Y S T AL L I N_B ET AGAMMA ; 1 . 

DR PROSITE; PS00639; THIOL_PROTEASE_HI S ; 1. 

KW ATP-binding . 

SQ SEQUENCE 4639 AA; 530172 MW; 2A3D4 F1F8BE52 096 CRC64; 

Query Match 75.0%; Score 6; DB 5; Length 4639; 

Best Local Similarity 100.0%; Pred. No. 7.2e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 NLDAEF 8 

I I I II I 

Db 3885 NLDAEF 3890 



RESULT 2 9 
Q8CG65 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RA 
RA 
RT 
RT 
RL 
DR 



PRELIMINARY; 



PRT; 4998 AA. 



(TrEMBLrel . 
(TrEMBLrel . 
(TrEMBLrel . 



23, Created) 

23, Last sequence update) 
25, Last annotation update) 



Q8CG65 
Q8CG65; 
01-MAR-2003 
01-MAR-2003 
01-OCT-2003 
SCO-spondin . 
SCO-SPONDIN. 
Mus musculus (Mouse) 
Eukaryota ; Metazoa ; 
Mammalia; Eutheria ; 
NCBI_TaxID=10090; 

[1] 

SEQUENCE FROM N . A. 
STRAIN=ICR; TISSUE=Brain; 
Goncalves N., Simon-Chazottes 
Guenet J-L., Meiniel R.; 

"Characterization, spatio-temporal expression and chromosomal 
assignment of mouse SCO-spondin . " ; 

Submitted (JUN-2002) to the EMBL/ GenBank/DDBJ databases. 



Chordata ; 
Rodent ia ; 



Craniata; Vertebrata; Euteleos tomi ; 
Sciurognathi ; Muridae; Murinae; Mus. 



D. 



Creveaux I 



Meiniel A. 



EMBL; AJ491857 



CAD42654 . 1; 



DR 


GO; 


GO: 


0005576; 


C: 


extracellular; IEA. 


DR 


GO; 


GO: 


0005507; 


F: 


copper ion binding; IEA. 


DR 


GO; 


GO: 


0005489; 


F: 


electron transporter activity; 


DR 


GO; 


GO: 


0005179; 


F: 


hormone activity; IEA. 


DR 


GO; 


GO: 


0007155; 


P: 


cell adhesion; IEA. 



IEA. 



GO; GO: 0006118; Prelectron transport; TEA. 



DR 

DR InterPro; 

DR InterPro; 

DR InterPro; 

DR InterPro; 

DR InterPro; 

DR InterPro; 

DR InterPro; 

DR InterPro; 

DR InterPro; 

DR InterPro; 

DR InterPro; 

DR InterPro ; 

DR Pfam; 

DR Pfam; 

DR Pfam; 

DR Pfam; 

DR Pfam; 

DR Pfam; 

DR PRINTS 

DR PRINTS 

DR SMART ; 

DR SMART ; 

DR SMART ; 

DR SMART; 

DR SMART; 

DR PROSITE 

DR PROSITE 

DR PROSITE 

DR PROSITE 

DR PROSITE 

DR PROSITE 

DR PROSITE 

DR PROSITE 

DR PROSITE. 

DR PROSITE; 

SQ SEQUENCE 



IPR000923 
IPR006207 
IPR006209 
IPR000421 
IPR001545 
IPR002172 
IPR002919 
IPR000884 
IPR008085 
IPR006552 
IPR001007 
IPR001846 
F5 F8 



BlueCu_l . 
Cys__knot_C . 
EGF_like . 
FA5 8_C. 
Gly_hormoneB . 
LDL_receptor_A. 
TIL_Cysrich . 
TSP1 . 



PF00754 
PF00057 
PF01826 
PF00090 
PF00093 
PF00094 ; 
; PR00261 
; PR01705 
SM00192 
SM00209 
SM00214 
SM00215 
SM00216; 
PS00196 
PS01225 
PS01186 
PS50022 
PS00261 
PS01209 
PS50068 
PS50092 
PS01208 
PS50184 
4998 



TSP_1 . 
VC_out . 
VWF_C . 
VWF_D . 
_ _J:ype_C; 1. 
ldl_recept_a; 10. 
TIL; 10. 
tsp_l; 25. 
vwc ; 1 . 
vwd ; 3 . 

; LDLRECEPTOR. 
; TSP1REPEAT . 
LDLa; 10. 
TSP1; 25. 
VWC ; 4 . 
VWC_out ; 9 . 
VWD ; 2 . 

COPPER_BLUE; 1 
CTCK_2 ; 1 . 
EGF_2 ; 2 . 
FA58C_3; 1. 
GLYCO_HORMONE_ 
LDLRA_1; 8. 
LDLRA_2; 10. 
TSP1; 25. 
VWFC_1 ; 1 . 
VWFC_2 ; 2 . 
AA; 535019 MW; 



BETA 1; 2 



DA2ABA8 DA4 7 DF2 2 5 CRC64; 



Query Match 75.0%; Score 6; DB 11; Length 4998; 

Best Local Similarity 100.0%; Preci. No. 7.7e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



3 NLDAEF 8 

I I I I I I 
2 8 63 NLDAEF 2 8 68 



RESULT 
Q8 8UM4 
ID 
AC 
DT 
DT 
DT 
DE 
GN 



30 



PRELIMINARY; 



OS 



Q8 8UM4 
Q8 8UM4 ; 
01- JUN-2003 
01-JUN-2003 
01-JUN-2003 
Prophage Lp2 protein 
LP_244 6. 
Lactobacillus plantarum 



(TrEMBLrel. 24, 
(TrEMBLrel. 24, 
(TrEMBLrel. 24, 
11 . 



PRT; 



46 AA. 



Created) 

Last sequence update) 
Last annotation update) 



OC Bacteria; Firmicutes ; Lactobacillales ; Lactobacillaceae; 

OC Lactobacillus. 

OX NCBI_TaxID=1590 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=NCIMB 8826 / WCFS1; 

RX MEDLINE=224 8 029 6; PubMed=125 665 66 ; 

RA Kleerebezem M. , Boekhorst J., van Kranenburg R. , Molenaar D., 

RA Kuipers O.P., Leer R. , Tarchini R. , Peters S.A. , Sandbrink H.M., 

RA Fiers M.W.E.J., Stiekema W., Klein Lankhorst R.M., Bron P. A., 

RA Hoffer S.M., Nierop Groot M.N., Kerkhoven R. , De Vries M. , Ursing B. 

RA De Vos W.M., Siezen R.J.; 

RT "Complete genome sequence of Lactobacillus plantarum WCFS1-"; 

RL Proc. Natl. Acad. Sci . U.S.A. 100:1990-19 95(2 003). 

DR EMBL; AL935259; CAD64752.1; -. 

KW Complete proteome . 

SQ SEQUENCE 46 AA; 5193 MW; 9E0025 9EEEA5 9996 CRC64; 



Query Match 62.5%; Score 5; DB 16; Length 46; 

Best Local Similarity 100.0%; Pred. No. 1 . 6e+02 ; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 4 LDAEF 8 

EMM 

Db 33 LDAEF 37 



RESULT 31 
Q9PVI4 

ID Q9PVI4 PRELIMINARY; PRT; 49 AA. 

AC Q9PVT4; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Regulator of G-protein signalling 4 (Fragment). 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves ; Neognathae; Galliformes; Phasianidae; Phasianinae 

OC Gallus . 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Dorsal root ganglion; 

RA Schiff M. , Jordan J.D., Ortiz D.F., De Vries L . , Fischer T., 

RA Farquhar M.G., Diverse-Pierluis si M. ; 

RT "Selective effects of GAIP and RGS4 on desensitization of neuronal 

RT signalling pathways."; 

RL Submitted (SEP-1998) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AF090091; AAF00033.1; 

DR HSSP; P49799; 1AGR. 

DR GO; GO: 0004871; F: signal transducer activity; IEA. 

DR InterPro; IPR000342; Regl_Gprotein . 

DR Pfam; PF00615; RGS; 1. 

DR PROSITE; PS50132; RGS; 1. 

FT NON_TER 1 1 

FT NONJTER 4 9 4 9 

SQ SEQUENCE 49 AA; 5701 MW; 0E6F58 02FAF55973 CRC64; 



Query Match 62.5%; Score 5; DB 13; Length 49; 

Best Local Similarity 100.0%; Pred. No. 1.7e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 1 EVNLD 5 

Mill 

Db 10 EVNLD 14 



RESULT 32 
Q8KM85 

ID Q8KM85 PRELIMINARY; PRT; 50 AA. 

AC Q8KM85; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein. 

OS Mycoplasma suis . 

OC Bacteria; Firmicutes; Mollicutes ; Mycoplasmataceae ; Mycoplasma. 

OX NCBI_TaxID=57372 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=54/96; 

RA Hoelzle L.E., Adelt D., Hoelzle K. , Heinritzi K. , Wittenbrink M.M.; 

RT "Purification and analysis of Mycoplasma suis (Eperythrozoon suis) DNA 

RT from porcine blood."; 

RL Submitted (AUG-2002) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AJ504999; CAD44546.1; -. 

KW Hypothetical protein. 

SQ SEQUENCE 50 AA; 5767 MW; 93B27 4 5C684D6547 CRC64 ; 

Query Match 62.5%; Score 5; DB 2; Length 50; 

Best Local Similarity 100.0%; Pred. No. 1.7e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 NLDAE 7 

MM! 

Db 26 NLDAE 30 



RESULT 33 


Q8CKW2 


ID 


Q8CKW2 PRELIMINARY; PRT; 55 AA. 


AC 


Q8CKW2; 


DT 


01-MAR-2003 (TrEMBLrel. 23, Created) 


DT 


01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 


DT 


01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 


DE 


Hypothetical . 


GN 


Y2748. 


OS 


Yersinia pestis . 


OC 


Bacteria; Proteobacteria; Gammaproteobacteria ; Enterobacteriales; 


OC 


Enterobacteriaceae ; Yersinia. 


ox 


NCBI TaxID=632; 


RN 


[1] 


RP 


SEQUENCE FROM N.A. 


RC 


STRAIN=KIM5 / Biovar Mediaevalis; 



RX MEDLINE=22137 8 63; PubMed=12 14243 0 ; 

RA Deng W. , Burland V., Plunkett G . Ill, Boutin A., Mayhew G.F., Liss P 

RA Perna N . T . , Rose D.J., Man B., Zhou S., Schwartz D.C., 

RA Fetherston J.D., Lincller L.E., Brubaker R.R., Piano G.V., 

RA Straley S.C., McDonough K.A. , Nilles M.L., Matson J.S., Blattner F . R 

RA Perry R. D . ; 

RT "Genome sequence of Yersinia pestis KIM." ; 

RL J. Bacteriol. 184:4 601-4611(2002). 

DR EMBL; AE013877; AAM86301.1; -. 

KW Hypothetical protein. 

SQ SEQUENCE 55 AA; 5908 MW; FBC89479AC2DCB8 1 CRC64 ; 

Query Match 62.5%; Score 5; DB 16; Length 55; 

Best Local Similarity 100.0%; Pred. No. 1.9e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 2 VNLDA 6 

Mill 

Db 9 VNLDA 13 



RESULT 
Q8WYE8 



34 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RA 
RL 
RN 
RP 
RA 
RL 
RN 
RP 
RA 
RL 
DR 
FT 
FT 
SQ 



PRELIMINARY; 



Q8WYE8 
Q8WYE8; 

01-MAR-2 0 02 (TrEMBLrel 
01-MAR-2 0 02 (TrEMBLrel 
01- JUN-2 0 03 (TrEMBLrel 
Dystrophin (Fragment) . 
DMD. 

Homo sapiens (Human) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
NCBI_TaxID=9 60 6; 
[1] 

SEQUENCE FROM N.A. 

Beggs A.H., Koenig M. , Kunkel 



PRT; 



63 AA. 



20, Created) 

2 0, Last sequence update) 
24, Last annotation update) 



Craniata; Vertebrata; Euteleos tomi ; 
Catarrhini; Hominidae; Homo. 



L.M. 



(DEC-1999) to the EMBL/ GenBank/DDBJ databases. 



Submitted 
[2] 

SEQUENCE FROM N.A. 
Worton R . G . ; 
Submitted (DEC-19 99) 
[3] 

SEQUENCE FROM N.A. 
den Dunnen J.T.; 
Submitted (DEC-19 99) 
EMBL; AF213410; AAL61558 
NON_TER 1 1 

NON_TER 63 63 

SEQUENCE 63 AA; 7147 MW; 

Query Match 62.5%; 
Best Local Similarity 100.0% 
Matches 5; Conservative 



to the EMBL/ GenBank/DDBJ databases. 



to 



the 
.1; 



EMBL/ GenBank/DDBJ databases. 



C57D0C273C1E8A82 CRC64 ; 

Score 5; DB 4; Length 63; 
Pred. No. 2.1e+02; 
0; Mismatches 0; Indels 



0 ; Gaps 



Qy 



1 EVNLD 5 

Mill 



Db 



18 EVNLD 22 



RESULT 
Q96X11 



35 



ID 
AC 
DT 
DT 
DT 
DE 
OS 

oc 
oc 
ox 

RN 
RP 
RC 
RA 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
FT 
FT 
SQ 



Q96X11 
Q96X11; 
01-DEC-2001 
01-DEC-2001 
01-OCT-2003 
Mating type 



PRELIMINARY; 



PRT; 



66 AA. 



(TrEMBLrel . 19, 
(TrEMBLrel . 19, 
(TrEMBLrel . 25, 



Created) 

Last sequence update ) 
Last annotation update) 
2 protein (Fragment) . 
Phaeosphaeria avenaria f . sp . triticae . 

Eukaryota; Fungi; Ascomycota; Pezizomycotina; Dothideomycetes ; 
Pleosporales ; Phaeosphaeriaceae ; Phaeosphaeria. 
NCBI_TaxID=54790; 
[1] 

SEQUENCE FROM N.A. 

STRAIN=ATCC2 637 0; 

Dai Q. , Cui K., Ueng P.P.; 

"Mating type gene in Phaeosphaeria avenaria f. sp . triticae."; 
Submitted (MAR-2001) to the EMBL/ GenBank/DDB J databases. 
EMBL; AF358826 



GO; 


GO: 


0016021; 


GO; 


GO: 


0003677; 


GO; 


GO: 


0001584; 


GO; 


GO: 


0007186; 


GO; 


GO: 


0006355; 



IEA. 



AAK51442.1; -. 
C: integral to membrane; 
F : DNA binding; IEA. 

F : rhodop sin-like receptor activity; IEA. 
P:G-protein coupled receptor protein signalin. . .; 
P : regulation of transcription, DNA-dependent ; IEA. 
Inter Pro; IPR0 0 027 6; GPCR_Rhodpsn . 
InterPro; IPR000910; HMG_12_box. 
Pfam; PF00505; HMG_box; 1. 
SMART; SM0039 8; HMG; 1. 
PROSITE; PS00237; G PROTEIN 
PS50118; 
1 
66 
66 AA; 



IEA. 



PROSITE; 
NON_TER 
NON_TER 
SEQUENCE 



HMG_BOX 
1 
66 

7936 MW; 



_RECEP 
2; 1 . 



Fl 1; 1. 



532A84F37D334DBA CRC64; 



Qy 

Db 



Query Match 62.5%; Score 5; DB 3; Length 66; 

Best Local Similarity 100.0%; Pred. No. 2.2e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 

3 NLDAE 7 



0; Gaps 



0; 



3 5 NLDAE 3 9 



RESULT 3 6 
Q9TU51 

ID Q9TU51 PRELIMINARY; PRT; 67 AA. 

AC Q9TU51; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 
DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 
DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 
DE Regulator of G-protein signalling 16 (Fragment) . 
OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
OC Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; Sus. 



OX NCBI_TaxID=9823; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=9 92 94 62 6; PubMed=103 64 4 56 ; 

RA Panetta R., Guo Y., Magder S., Greenwood M.T.; 

RT "Regulators of G-protein signaling (RGS) 1 and 16 are induced in 

RT response to bacterial lipopolysaccharide and stimulate c-fos promoter 

RT expression. "; 

RL Biochem. Biophys . Res. Commun . 25 9:550-55 6(1999). 

DR EMBL; AF139836; AAD53051.1; -. 

DR HSSP; P49799; 1AGR . 

DR GO; GO: 0004871; F: signal transducer activity; IEA. 

DR InterPro; IPR000342; Regl_Gprotein . 

DR Pfam; PF00615; RGS; 1. 

DR ProDom; PD001580; Regl_Gprotein ; 1. 

DR PROSITE; PS50132; RGS; 1. 

FT NON__TER 1 1 

FT NON_TER 67 67 

SQ SEQUENCE 67 AA; 7650 MW; 7335579D0FC83F83 CRC64; 

Query Match 62.5%; Score 5; DB 6; Length 67; 

Best Local Similarity 100.0%; Pred. No. 2.2e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLD 5 

I I I I I 

Db 34 EVNLD 3 8 



RESULT 37 
Q9IB55 



ID 


Q9IB55 PRELIMINARY; PRT; 67 AA. 




AC 


Q9IB55; 




DT 


01-OCT-2000 (TrEMBLrel. 15, Created) 




DT 


01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 




DT 


01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 




DE 


Regulators of G protein signaling XRGSV (Fragment) . 




OS 


Xenopus laevis (African clawed frog) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleos tomi ; 


OC 


Amphibia; Batrachia; Anura; Mesobatrachia ; Pipoidea 


; Pipidae; 


OC 


Xenopodinae; Xenopus. 




OX 


NCBI TaxID=8355; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RX 


MEDLINE=2 02 00 091; PubMed=l 07 33 901 ; 




RA 


Saitoh 0., Odagiri M. , Masuho I . , Nomoto S., Kinoshita N.; 


RT 


"Molecular cloning and characterization of Xenopus 


RGS 5 . "; 


RL 


Biochem. Biophys. Res. Commun. 270:34-39(2000). 




DR 


EMBL; AB038438; BAA95152.1; -. 




DR 


HSSP; P49799; 1AGR. 




DR 


GO; GO: 0004871; F: signal transducer activity; IEA. 




DR 


InterPro; IPR000342; Regl Gprotein. 




DR 


Pfam; PF00615; RGS; 1 . 




DR 


PRINTS; PR01301; RGSPROTEIN. 




DR 


ProDom; PD001580; Regl Gprotein; 1. 




DR 


PROSITE; PS50132; RGS; 1. 




FT 


NON TER 1 1 





FT NONJIER 67 67 

SQ SEQUENCE 67 AA; 7906 MW; 7 4EF3B8 7 682 6CECE CRC64 ; 

Query Match 62.5%; Score 5; DB 13; Length 67; 

Best Local Similarity 100.0%; Pred. No. 2.2e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLD 5 

ill!! 

Db 34 EVNLD 38 



RESULT 3 8 
Q9IB58 

ID Q9IB58 PRELIMINARY; PRT; 67 AA. 

AC Q9IB58; 

DT 01-OCT-2000 (TrEMBLrel . 15, Created) 

DT 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Regulators of G protein signaling XRGSII (Fragment). 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia ; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI_TaxID=8355; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 02 00091; PubMed=l 0733901 ; 

RA Saitoh O., Odagiri M. , Masuho I., Nomoto S., Kinoshita N.; 

RT "Molecular cloning and characterization of Xenopus RGS5."; 

RL Biochem. Biophys . Res. Commun. 2 7 0:34-39(2000). 

DR EMBL; AB038435; BAA95149.1; -. 

DR HSSP; P49799; 1AGR . 

DR GO; GO: 0004871; F: signal transducer activity; IEA. 

DR InterPro; IPR000342; Regl_Gprotein . 

DR Pfam; PF00615; RGS ; 1. 

DR PRINTS; PR01301; RGSPROTEIN. 

DR ProDom; PD001580; Regl_Gprotein; 1. 

DR PROSITE; PS50132; RGS; 1. 

FT NON_TER 1 1 

FT NON_TER 67 67 

SQ SEQUENCE 67 AA; 7716 MW; AA0ED8E536609E7B CRC64; 

Query Match 62.5%; Score 5; DB 13; Length 67; 

Best Local Similarity 100.0%; Pred. No. 2.2e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLD 5 

Mill 

Db 34 EVNLD 38 



RESULT 39 
Q8CN01 

ID Q8CN01 PRELIMINARY; PRT; 68 AA. 

AC Q8CN01; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 



DT 01-MAR-2003 (TrEMBLrel . 23, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Mercuric ion-binding protein. 

GN SE2120. 

OS Staphylococcus epidermidis . 

OC Bacteria; Firmicutes; Bacillales ; Staphylococcus. 

OX NCBI_TaxID=1282 ; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=ATCC 12228; 

RA Zhang Y . , Ren S . , Li H., Fu G. , Lu L . , Lu G . , Jia J., Tu Y. , Qin Z., 

RA Chen Z . , Wen Y. ; 

RL Submitted (NOV-2 002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AE016751; AAO05762.1; -. 

DR GO; GO: 0005507; F: copper ion binding; IEA. 

DR GO; GO: 0005375; F: copper ion transporter activity; IEA. 

DR GO; GO: 0046872; F:metal ion binding; IEA. 

DR GO; GO: 0006825; P: copper ion transport; IEA. 

DR GO; GO: 0030001; P:metal ion transport; IEA. 

DR InterPro; IPR000428; Cu_bind. 

DR InterPro; IPR006122; Cu_bind_dom. 

DR InterPro; IPR006121; HeavyMe_transpt . 

DR InterPro; IPR006191; Metal_bind. 

DR Pfam; PF00403; HMA; 1. 

DR PRINTS; PR00944; CUEXPORT . 

DR TIGRFAMs; TIGR00003; TIGR00003; 1. 

DR PROSITE; PS01047; HMA_1 ; 1. 

DR PROSITE; PS50846; HMA_2 ; 1. 

KW Complete proteome . 

SQ SEQUENCE 68 AA; 7661 MW; F2BE2AC69 92E3 6AA CRC64; 

Query Match 62.5%; Score 5; DB 16; Length 68; 

Best Local Similarity 100.0%; Pred. No. 2.2e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLD 5 

I I I I I 

Db 34 EVNLD 3 8 



RESULT 4 0 
Q8NKU7 

ID Q8NKU7 PRELIMINARY; PRT; 7 0 AA. 

AC Q8NKU7; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Membrane-bound NiFe hydrogenase. 

GN HYP1. 

OS Acidianus ambivalens ( Desulf urolobus ambivalens). 

OC Archaea; Crenarchaeota ; Thermoprotei ; Sulf olobales ; Sul f olobaceae ; 

OC Acidianus . 

OX NCBI_TaxID=2283; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Lei 10; 

RA Laska S., Kletzin A.; 



RT "Biochemical and Molecular analysis of the membrane-bound hydrogenase 

RT of the hyperthermophilic and acidophilic archaeon Acidianus 

RT ambivalens . "; 

RL Submitted (AUG-2001) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N . A. 

RC STRAIN=Lei 10; 

RA Laska S . ; 

RL Thesis (2000), Institute of Microbiology and Genetics, Darmstadt. 

DR EMBL; AJ320523; CAC86893.1; -. 

SQ SEQUENCE 70 AA; 8189 MW; CFDDEC93F9FF7A2E CRC64; 



Query Match 62.5%; Score 5; DB 1; Length 70; 

Best Local Similarity 100.0%; Pred. No. 2.3e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 EVNLD 5 

I I I I I 

Db 4 3 EVNLD 47 



Search completed: March 26, 2004, 15:35:45 
Job time : 81 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



Title: 

Perfect score 
Sequence : 



March 26, 2004, 15:30:32 ; Search time 10.5 Seconds 

(without alignments) 
39.672 Million cell updates/sec 

US- 0 9-668-31 4C— 7 1 
8 

1 EVNLDAEF 8 



Scoring table: OLIGO 

Gapop 60.0 , Gapext 60.0 



Searched : 
Word size : 



141681 seqs, 52070155 residues 



0 



Total number of hits satisfying chosen parameter; 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Listing first 1000 summaries 

Database : SwissProt 42:* 



141681 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



o . 


Score 


Match 


Length 


DB 


ID 


Des cription 


1 


7 


87 . 5 


419 


1 


P47K_PSECL 


P31521 


pseudomonas 


2 


6 


75 . 0 


295 


1 


DAP 1_BACHD 


Q9kc32 


bacillus ha 


3 


6 


75. 0 


368 


1 


K1CR XENLA 


P08802 


xenopus lae 


4 


6 


75. 0 


394 


1 


ALR2 BACSU 


P94494 


bacillus su 


5 


6 


75 . 0 


621 


1 


MBHL_ALCHY 


P33374 


alcaligenes 


6 


6 


75 . 0 


4639 


1 


DYHC_DROME 


P37276 


drosophila 


7 


5 


62 . 5 


67 


1 


RGS3_RAT 


P49797 


rattus norv 


8 


5 


62 . 5 


73 


1 


PHD_BPP1 


Q06253 


bacteriopha 


9 


5 


62.5 


90 


1 


EF1B SULSO 


Q64214 


sulf olobus 


10 


5 


62 . 5 


91 


1 


EF1B PYRAB 


Q9v2p6 


pyrococcus 


11 


5 


62 . 5 


91 


1 


EF1B_PYRFU 


Q8tzm5 


pyrococcus 


12 


5 


62 . 5 


91 


1 


EFlB_SULTO 


Q976h8 


sulf olobus 


13 


5 


62 . 5 


92 


1 


KMl 1_TRYBB 


Q26773 


trypanosoma 


14 


5 


62 . 5 


92 


1 


KM11JTRYCR 


Q9u6zl 


trypanosoma 


15 


5 


62 . 5 


119 


1 


ACPS_BACHD 


Q9kfgl 


bacillus ha 


16 


5 


62 . 5 


126 


1 


PFD4__CAEEL 


Q17435 


caenorhabdi 


17 


5 


62 . 5 


159 


1 


GREA CHLTE 


Q8kch5 


chlorobium 



18 


5 


62 


. 5 


172 


1 


YFIR_ECOLI 


P7 6597 


^ c l~i r"i ^ 


19 


5 


62 


. 5 


184 


1 


KAD 1_ANAS P 


08vDi 8 


x 1 ci ^ x i- * — ' K--" 


20 


5 


62 


. 5 


185 


1 


TBP METKA 


08tx38 


TT1 h A Ti OTJ Vfl] 


21 


5 


62 


. 5 


197 


1 


OM2 6 HAEIN 


057483 


h a PTnnrih 1 "1 1 1 <=■ 

lid dlLl^ Lji i 1 J 1_ d >_> 


22 


5 


62 


. 5 


197 


1 


YDB6 YEAST 


012055 


55 Fi C cYl Pi KOTTlVr* 

*J CL V — ^ V < i 1 GL J W ILL V 


23 


5 


62 


. 5 


202 


1 


GDIR YEAST 


012434 


o ci v_- v^*i i ci i— Vh/ilv y v -~- 


24 


5 


62 


. 5 


205 


1 


RGS4 HUMAN 


P4 9798 


hrnnn q ^ n i fti 

IIViv/ILL^^ C^L —1— ^11 


25 


5 


62 


. 5 


205 


1 


RGS4 MOUSE 


008899 


Trill q mi 1 <^ C 11 1 u 

1 Uui O ILL L_4. p^> V** L*»l -1- v^l 


26 


5 


62 


. 5 


205 


1 


RGS4 RAT 


P4 9799 


-L. Cl <w L_ l^X ^ 1 1 v> -l_ V 


27 


5 


62 


. 5 


217 


1 


NFNB ECOLI 


P38489 


^ c; Vi <=i r" "i <""Vi"i ^ 
^— o ' — ale J Lt il-Ld 


28 


5 


62 


. 5 


217 


1 


NFNB SALTY 


P158 8 8 


c ^ 1 mpiri (2* 1 "I ^ 

O d _1_ ±1 ll_J 1 1 C J L a. 


29 


5 


62 


.5 


225 


1 


MINC PASMU 


P57845 


QO LC i-1 1_ VZT _l l_ CI 


30 


5 


62 


. 5 


229 


1 


Y127 TREPA 


083164 


1 - T" (""vn y> 
J_ ctij v>l l Let 


31 


5 


62 


. 5 


231 


1 


CTRA CAUCR 


04 S994 




32 


5 


62 


. 5 


234 


1 


RL1 HELP J 


09 -7k2 1 




33 


5 


62 


. 5 


235 


1 


RGSI HUMAN 


09ns2 8 


ii^— 'iin_J oauicn 


34 


5 


62 , 


. 5 


239 


1 


NPD RHOBA 


07ufa9 


1 11U l— i. vJ LJ J_ i_ ™ J L 


35 


5 


62, 


. 5 


250 


1 


Y514 METJA 


057934 


mf 1 'l"1^3'no^ , o^ , ^~ , 


36 


5 


62 , 


. 5 


251 


1 


E1A ADE41 


PI 0 S42 


li Hi LLci 1 1 dUCHU 


37 


5 


62 . 


. 5 


277 


1 


HEMK BUCBP 


OR 9^ t 0 


JJ UL-llIlci. a. c4.£J 


38 


5 


62 . 


. 5 


283 


1 


KPRS MET AC 


08 t rk8 


liter L-iicill^ o d. J_ o 


39 


5 


62 , 


. 5 


290 


1 


G3P LACDT 


P5507 0 


1 A i - a i — i lie; W 


40 


5 


62. 


. 5 


291 


1 


G3P BOLED 


000301 


Viol ^ "t - 1 T C! pHi 1 


41 


5 


62 . 


. 5 


304 


1 


G3P2 DROPS 


\-J 1 *± X w 4 


UlCJb CJ^JIl J_ _L ci 


42 


5 


62 . 


, 5 


304 


1 


G3P2 DROSU 


04 4105 


Hrocsorih i 1 p* 
j_ w o uu 1 1 J La 


43 


5 


62 . 


. 5 


305 


1 


Y031 STRGR 


P362 61 

i- KJ iL. \J _L 


O L-LCp L (JILL V " 


44 


5 


62 . 


, 5 


318 


1 


KDGT XANAC 




ACUl L- 1 1 t_Jl L iu lido 


45 


5 


62. 


. 5 


330 


1 


VP 19 HSVSA 


001051 


11 C 4_ ^ ~ O V _L J_ LI O 


46 


5 


62 . 


5 


332 


1 


G3P1 DROME 


pf)74Rfi 

£ W / 1 V \J 


UlUoupillXa 


47 


5 


62 . 


5 


332 


1 


G3P2 DROME 


P07487 


c\ r AQnrih "i "1 p» 

V-t-l- w O w LJ 11 J L CI 


48 


5 


62. 


5 


332 


1 


G3P DROHY 


001597 


Hrn^nnh i "1 ^ 

-L W O V^V 1 1 J L CI 


49 


5 


62 . 


5 


334 


1 


YMOl RHIME 

-*-r A — ■ — J- *--K X mAmr Ar -At 


P4 93 0 5 


i 7rihi l im m 

J-ll-L. t-i\Jb*J-L- L11LL ILL 


50 


5 


62 . 


5 


337 


1 


G3P OMPOL 


VJ L- L J ^ 


nmnha 1 nt*n q 

l i£s 1 1 Ct J_ U L UiJ 


51 


5 


62 . 


5 


337 


1 


G3P PHACH 


W J. J U 


^Jl lcl 1 1 1; X. Ol let t: 


52 


5 


62 . 


5 


337 


1 


G3P USTMA 


P093 1 7 


1 1 c 1 — ! "1 

L_i <-J l—J LaVJ u 11 LCI 


53 


5 


62 . 


5 


337 


1 


MAT 2 COCSA 


09r>4 4 S 


pnrh 1 i nHn 1 ii 

v_^v^> V^ll J. _i_ y^/i^J W _l_ LI 


54 


5 


62 . 


5 
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P26469 salmonella 
Q03544 bacteriopha 
06772 8 aquif ex aeo 
03237 8 methyl obaci 
P337 4 6 Clostridium 
P29509 saccharomyc 
P43788 haemophilias 
Q9kss4 vibrio chol 
028277 archaeoglob 
P54941 bacillus su 
P47541 mycoplasma 
P75359 mycoplasma 
Q9a3g2 caulobacter 
P39916 coxiella bu 



ALIGNMENTS 



RESULT 1 
P47K_PSECL 

ID P47K_PSECL STANDARD; PRT ; 419 AA. 

AC P31521; 

DT 01-JUL-1993 (Rel. 26, Created) 
DT 01-JUL-1993 (Rel. 26, Last sequence update) 
DT 16-OCT-2001 (Rel. 40, Last annotation update) 
DE 47 kDa protein (P47K) . 

OS Pseudomonas chlororaphis (Pseudomonas aureof aciens ) . 

OC Bacteria; Proteobacteria ; Gammaproteobacteria ; Pseudomonadales ; 

OC Pseudomonadaceae; Pseudomonas. 

OX NCBI_TaxID=333; 

RN [1] 

RP SEQUENCE FROM N.A. 
RC STRAIN=B2 3; 

RX MEDLINE=91193202; PubMed-2 013568 ; 

RA Nishiyama M. , Horinouchi S., Kobayashi M. , Nagasawa T., Yamada H., 
RA Beppu T . ; 

"Cloning and characterization of genes responsible for metabolism of 
nitrile compounds from Pseudomonas chlororaphis B23."; 
RL J. Bacterid. 173:24 65-2 4 72(1991). 

CC FUNCTION: THE PRESENCE OF P47K IS CRITICAL FOR THE EXPRESSION 

CC OF THE NITRILE HYDRATASE GENES. MAY STABILIZE OR ACTIVATE THE 

CC NITRILE HYDRATASE PROTEINS. 



RT 
RT 



CC 



CC This SWTSS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; D90216; BAA14247.1; 

DR PIR; D42725; D42725. 

DR InterPro; IPR003495; CobW . 

DR Pfam; PF02492; cobW; 1. 

SQ SEQUENCE 419 AA; 46666 MW; FF5113800E27FF0C CRC64; 



Query Match 87.5%; Score 7; DB 1; Length 419; 

Best Local Similarity 100.0%; Pred. No. 1; 

Matches 7 ; Conservative 0; Mismatches 0; Indels 0 ; Gaps 0; 

Qy 1 EVNLDAE 7 

I ! I I I I I 
Db 4 8 EVNLDAE 54 



RESULT 2 
DAP 1_BACH D 

ID DAP1__BACHD STANDARD; PRT; 2 95 AA. 

AC Q9KC32; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Dihydrodipicolinate synthase 1 (EC 4.2.1.52) (DHDPS 1). 

GN DAPA1 OR BH1742. 

OS Bacillus halodurans . 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

OX NCBI_TaxID=86665; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C-125 / JCM 9153; 

RX MEDLINE=20512582 ; PubMed=11058 132 ; 

RA Takami H., Nakasone K., Takaki Y., Maeno G., Sasaki R. , Masui N-, 

RA Fuji F., Hirama C, Nakamura Y. , Ogasawara N . , Kuhara S., 

RA Horikoshi K. ; 

RT "Complete genome sequence of the alkaliphilic bacterium Bacillus 

RT halodurans and genomic sequence comparison with Bacillus subtilis."; 

RL Nucleic Acids Res. 2 8:4 317-4331(2 000). 

CC -!- CATALYTIC ACTIVITY: L-aspartate 4-semialdehyde + pyruvate = 
CC dihydrodipicolinate + 2 H(2)0. 

CC -!- PATHWAY: Biosynthesis of diaminopimelate and lysine from aspartate 

CC semi aldehyde; first step. 

CC -!- SUBUNIT: Homotetramer (By similarity). 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 

CC SIMILARITY: Belongs to the DHDPS family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AP001513; BAB05461.1; -. 

DR PIR; F83867; F83867. 

DR HSSP; P05640; 1DHP . 

DR HAMAP; MF_00418; -; 1. 

DR InterPro; IPR005263; DapA. 

DR InterPro; IPR002220; DHDPS. 

DR Pf am; PF00701 ; DHDPS ; 1 . 

DR PRINTS; PR00146; DHPICSNTHASE . 

DR ProDom; PD001859; DHDPS; 1. 



DR TIGRFAMs; TIGR00674; dapA; 1. 

DR PROSITE; PS00665; DHDPS_1; 1. 

DR PROSITE; PS00666; DHDPS_2 ; 1. 

KW Lyase; Diaminopimelate biosynthesis; Lysine biosynthesis; 

KW Complete proteome. 

FT ACT_SITE 162 162 BY SIMILARITY. 

SQ SEQUENCE 295 AA; 31643 MW; 5 6D3567 07 6CD8FB5 CRC64 ; 

Query Match 75.0%; Score 6; DB 1; Length 295; 

Best Local Similarity 100.0%; Pred. No. 9; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDAE 7 

I I I I I I 

Db 142 VNLDAE 147 



RESULT 3 
K1CR_XENLA 

ID K1CR_XENLA STANDARD; PRT ; 368 AA. 

AC P08802; 

DT 01-NOV-1988 (Rel. 09, Created) 

DT 01-NOV-1988 (Rel. 09, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Keratin, type I cytoskeletal endo B (Fragment) . 

GN XK ENDO B. 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Mesobat rachia ; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI_TaxID=8355; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=8 9 0 92 007; PubMed=2 463213; 

RA Laflamme S.E., Jamrich M. , Richter K., Sargent T.D., Dawid I.B.; 

RT "Xenopus endo B is a keratin preferentially expressed in the 

RT embryonic notochord . " ; 

RL Genes Dev. 2:853-862(1988). 

CC -!- SUBUNIT: Heterotetramer of two type I and two type II keratins. 

CC -f- MISCELLANEOUS: There are two types of cytoskeletal and 
CC microfibrillar keratin, I (acidic) and II (neutral to basic) (40- 

CC 55 and 56-70 kDa, respectively) . 

CC -!- SIMILARITY: Belongs to the intermediate filament family. 

CC : 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; Y00230; CAA68372.1; -. 

DR PIR; A28825; A28825. 

DR InterPro; IPR001664; IF. 

DR InterPro; IPR002957; Keratin_I . 

DR Pfam; PF00038; filament; 1. 



DR 


PRINTS ; 


PR0124 8; TYPE1KERATIN . 


DR 


PROSITE; 


PS00226; 


IF; 1 . 






KW 


Intermediate filament; Coiled coil; Keratin. 


FT 


NON TER 


1 


1 






FT 


DOMAIN 


<1 


15 




HEAD. 


r 1 


DOMAIN 


16 


325 




ROD. 


FT 


DOMAIN 


326 


368 




TAIL. 


FT 


DOMAIN 


16 


54 




COIL 1A. 


FT 


DOMAIN 


55 


66 




LINKER 1. 


FT 


DOMAIN 


67 


162 




COIL IB. 


FT 


DOMAIN 


163 


183 




LINKER 12. 


FT 


DOMAIN 


184 


325 




COIL 2. 


SQ 


SEQUENCE 


368 AA; 


41683 MW; 2 9FC1 9B67E9E8 


Query Match 




75. 


0%; 


Score 6; DB 1 ; 


Best Local 


Similarity 


100 


. 0%; 


Pred. No. 11; 


Matches 


6; Conservative 




0; Mismatches 


Qy 


2 


VNLDAE 7 









Length 368; 



0; Indels 



0; Gaps 



0; 



Db 



I I I I I I 
22 0 VNLDAE 225 



RESULT 4 
ALR2_BACSU 

ID ALR2_BACSU STANDARD; PRT; 394 AA. 

AC P94494; 

DT 15-DEC-1998 (Rel. 37, Created) 

DT 15-DEC-1998 (Rel. 37, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Alanine racemase 2 (EC 5.1.1.1). 

GN ALR2 OR BSU17640. 

OS Bacillus subtilis . 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

OX NCBI_TaxID=142 3; 

RN [1] 

RP SEQUENCE FROM N . A. 

RA Borchert S., Klein C . , Piksa B., Hammelmann M. , Entian K.D.; 

RT "Sequencing of a 26 kb region of the Bacillus subtilis genome 

RT downstream of spoVJ. "; 

RL Submitted (FEB-1997) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=168; 

RX MEDLINE=9 8 04 4 033 ; PubMed=9 3 8 4 377; 

RA Kunst F., Ogasawara N., Moszer I., Albertini A.M., Alloni G., 

RA Azevedo V., Bertero M.G., Bessieres P., Bolotin A., Borchert S., 

RA Borriss R., Boursier L., Brans A., Braun M. , Brignell S.C., Bron S., 

RA Brouillet S., Bruschi C.V., Caldwell B., Capuano V., Carter N.M., 

RA Choi S.K., Codani J. J., Connerton I.F., Cummings N.J., Daniel R.A., 

RA Denizot F. , Devine K.M., Dusterhoft A., Ehrlich S.D., Emmerson P.T., 

RA Entian K.D., Errington J., Fabret C, Ferrari E., Foulger D., 

RA Fritz C, Fujita M. , Fujita Y., Fuma S., Galizzi A., Galleron N., 

RA Ghim S.Y., Glaser P., Goffeau A., Golightly E.J., Grandi G . , 

RA Guiseppi G . , Guy B.J., Haga K., Haiech J., Harwood C.R., Henaut A., 

RA Hilbert H., Holsappel S., Hosono S., Hullo M.F., Itaya M. , Jones L., 

RA Joris B . , Karamata D., Kasahara Y. , Klaerr-Blanchard M. , Klein C, 



RA Kobayashi Y. , Koetter P., Koningstein G., Krogh S., Kumano M. r 

RA Kurita K. , Lapidus A., Lardinois S., Lauber J., Lazarevic V. , 

RA Lee S.M., Levine A. , Liu H., Masuda S., Mauel C, Medigue C, 

RA Medina N., Mellado R.P., Mizuno M. , Moestl D., Nakai s., Noback M. , 

RA Noone D., O'Reilly M. , Ogawa K., Ogiwara A., Oudega B., Park S.H.,' 

RA Parro V. f Pohl T.M., Portetelle D . r Porwollik S., Prescott A.M., 

RA Presecan E., Pujic P., Purnelle B., Rapoport G., Rey M. , Reynolds S., 

RA Rieger M. , Rivolta C, Rocha E . , Roche B., Rose M. , Sadaie Y., 

RA Sato T., Scanlan E., Schleich S., Schroeter R. f Scoffone F., 

RA Sekiguchi J., Sekowska A., Seror S.J., Serror P . , Shin B.S., Soldo B., 

RA Sorokin A., Tacconi E . , Takagi T . , Takahashi H., Takemaru K. , 

RA Takeuchi M. r Tamakoshi A., Tanaka T. f Terpstra P., Tognoni A., 

RA Tosato V., Uchiyama S., Vandenbol M. , Vannier F. f Vassarotti A., 

RA Viari A., Wambutt R. , Wedler E . , Wedler H., Weitzenegger T . , 

RA Winters P., Wipat A., Yamamoto H., Yamane K. , Yasumoto K., Yata K. , 

RA Yoshida K. , Yoshikawa H.F., Zumstein E. , Yoshikawa H. , Danchin A. ; 

RT "The complete genome sequence of the Gram-positive bacterium Bacillus 

RT subtilis."; 

RL Nature 390:249-256(1997). 

CC FUNCTION: Provides the D-alanine required for cell wall 

CC biosynthesis (By similarity) . 

CC -!- CATALYTIC ACTIVITY: L-alanine = D-alanine. 

CC -!- COFACTOR: Pyridoxal phosphate (By similarity). 

CC -!- PATHWAY: D-alanine branch of peptidoglycan biosynthesis; first 
CC step. 

CC -!- SIMILARITY: Belongs to the alanine racemase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U66480; AAB41097.1; -. 

DR EMBL; Z99113; CAB13648.1; -. 

DR PIR; F69888; F69888. 

DR HSSP; P10724; 1BD0 . 

DR SubtiList; BG12267; alr2. 

DR HAMAP; MF_01201; -; 1 . 

DR InterPro; IPR000821; Ala_racemase . 

DR InterPro; IPR009006; Racem_decarbox_C . 

DR InterPro; IPR001608; UPF0001. 

DR Pfam; PF00842; Ala_racemase_C; 1. 

DR Pfam; PF01168; Ala_racemase_N; 1. 

DR PRINTS; PR00992; ALARACEMASE. 

DR TIGRFAMs; TIGR00492; air; 1. 

DR PROSITE; PS00395; ALAN I NE_ RACEMASE ; 1. 

KW Isomerase; Pyridoxal phosphate; Cell wall; Peptidoglycan synthesis; 

KW Complete proteome. 

FT ACT_SITE 39 39 CATALYTIC BASE SPECIFIC TO D-ALANINE (BY 

FT SIMILARITY) . 

FT ACT^SITE 272 272 CATALYTIC BASE SPECIFIC TO L -ALANINE (BY 

FT SIMILARITY) . 

FT BINDING 39 39 PYRIDOXAL PHOSPHATE (BY SIMILARITY) . 

SQ SEQUENCE 394 AA; 43648 MW; FE446C21107FBA6E CRC64 ; 



Query Match 75.0%; Score 6; DB 1; Length 394; 

Best Local Similarity 100.0%; Pred. No. 11; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDA 6 

I I I I I I 

Db 11 EVNLDA 16 



RESULT 5 
MBHL_ALCHY 

ID MBHL_ALCHY STANDARD; PRT; 621 AA. 

AC P33374; 

DT 01-FEB-1994 (Rel. 28, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Uptake hydrogenase large subunit (EC 1.12.99.6) (Hydrogenlyase ) 
DE (Membrane-bound hydrogenase large subunit) . 

GN HUPL. 

OS Alcaligenes hydrogenophilus . 

OC Bacteria; Proteobacteria ; Betaproteobacteria; Burkholderiales ; 

OC Alcaligenaceae; Alcaligenes. 

OX NCBI_TaxID=516; 
RN [1] 

RP SEQUENCE FROM N. A. 

RX MEDLINE-93193199; PubMed=12 94332 ; 

RA Yagi K. , Seto T., Terakado M. , Umeda F. , Doi T . , Imanishi T., 

RA Miura Y. ; 

RT "Nucleotide sequences of membrane-bound hydrogenase gene in 

RT Alcaligenes hydrogenophilus. 1 '; 

RL Chem. Pharm. Bull. 4 0:3292-3296(1992). 

CC -!- FUNCTION: This enzyme recycles the H(2) produced by nitrogenase to 
cc increase the production of ATP and to protect nitrogenase against 

cc inhibition or damage by 0(2) under carbon- or phosphate-limited 

CC conditions . 

- CATALYTIC ACTIVITY: H(2) + acceptor = 2 H(+) 4- reduced acceptor. 

- COFACTOR: Binds 1 nickel ion per subunit (By similarity) . 

- SUBUNIT: Heterodimer of a large and a small subunit. 

- SUBCELLULAR LOCATION: Membrane-bound. 

- SIMILARITY: Belongs to the [NiFej / [NiFeSe] hydrogenase large 
CC subunit family . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; S56898; AAB25780.1; ALT_SEQ. 

DR PIR; JH0776; JH0776. 

DR HSSP; P21852; 1H2R. 

DR InterPro; IPR001501; Ni_hdL. 

DR Pfam; PF00374; NiFeSe_Hases ; 1. 

DR PROSITE; PS00507; NI HGENASE L 1; FALSE NEG. 



CC 
CC 
CC 
CC 
CC 



DR 
KW 



PROSITE; PS00508; NI_HGENASE_L_2 ; 1. 
Oxidoreductase; Membrane; Metal-binding; Nickel 



FT 


METAL 


75 


75 


NICKEL (POTENTIAL) . 


FT 


METAL 


78 


78 


NICKEL (POTENTIAL) . 


FT 


METAL 


600 


600 


NICKEL (POTENTIAL) . 


FT 


METAL 


603 


603 


NICKEL (POTENTIAL) . 


SQ 


SEQUENCE 


621 AA; 


68924 


MW; 906AC4414285D4A3 CRC64 ; 



Query Match 75.0%; Score 6; DB 1; Length 621; 

Best Local Similarity 100.0%; Pred. No. 16; 

Matches 6; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDA 6 

I ! I I I I 

Db 33 EVNLDA 38 



0; Gaps 



0; 



RESULT 6 
DYHC_DROME 

ID DYHC_DROME STANDARD; PRT; 4 63 9 AA. 

AC P37276; 

DT 01-OCT-1994 (Rel. 30, Created) 

DT 01-OCT-1994 (Rel. 30, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Dynein heavy chain, cytosolic (DYHC) . 

GN CDHC OR DHC64C. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Mus comorpha ; 

OC Ephydroidea; Drosophilidae ; Drosophila. 

OX NCBI_TaxID=7227 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=94375524; PubMed=8 089 1 8 0 ; 

RA Li M., McGrail M. , Serr M. , Hays T.S.; 

RT "Drosophila cytoplasmic dynein, a microtubule motor that is 

RT asymmetrically localized in the oocyte."; 

RL J. Cell Biol. 12 6:14 75-14 94(1994). 

RN [2] 

RP SEQUENCE OF 1877-1998 FROM N.A. 

RX MEDLINE=94243034 ; PubMed=8 18 64 64 ; 

RA Rasmusson K., Serr M. , Gepner J., Gibbons I., Hays T.S.; 

RT "A family of dynein genes in Drosophila melanogaster."; 

RL Mol. Biol. Cell 5:45-55(1994). 

CC -!- FUNCTION: Cytoplasmic dynein acts as a motor for the intracellular 
cc retrograde motility of vesicles and organelles along microtubules. 

Dynein has ATPase activity; the force-producing power stroke is 
thought to occur on release of ADP. 

SUBUNIT: Consists of at least two heavy chains and a number of 
CC intermediate and light chains . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- DOMAIN: Dynein heavy chains probably consist of an N-terminal stem 
CC (which binds cargo and interacts with other dynein components), 

CC and the head or motor domain. The motor contains six tandemly- 

cc linked AAA domains in the head, which form a ring. A stalk-like 

cc structure (formed by two of the coiled coil domains) protrudes 

CC between AAA 4 and AAA 5 and terminates in a microtubule-binding 



CC 
CC 

CC - I - 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 



- i - 



site. A seventh domain may also contribute to this ring; it is not 
clear whether the N-terminus or the C-terminus forms this extra 
domain. There are four well-conserved and two non-conserved ATPase 
sites, one per AAA domain. Probably only one of these (within AAA 
1) actually hydrolyzes ATP, the others may serve a regulatory 
function . 

SIMILARITY: Belongs to the dynein heavy chain family. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



DR 
DR 
DR 
DR 
KW 



EMBL; L23195; AAA60323.1; -. 
EMBL; L25122; AAA28492.1; 
PIR; A54794; A54794. 
FlyBase; FBgn0010349; Dhc64C 



DR 


GO; 


GO: 


0045169; 


C: 


f usome; NAS . 


DR 


GO; 


GO: 


0005875; 


C: 


microtubule associated complex; IDA. 


DR 


GO; 


GO: 


0008567; 


F: 


dynein ATPase activity; IDA. 


DR 


GO; 


GO: 


0003777; 


F: 


microtubule motor activity; IDA. 


DR 


GO; 


GO: 


0007098; 


P: 


centrosome cycle; IMP. 


DR 


GO; 


GO: 


0007292; 


P : 


female gamete generation; IMP. 


DR 


GO; 


GO: 


0045478 ; 


P : 


fusome organization and biogenesis; NAS. 


DR 


GO; 


GO: 


0007018; 


P: 


microtubule-based movement; IDA. 


DR 


GO; 


GO: 


0007294; 


P : 


oocyte cell fate determination (sensu Insecta) ; IMP. 



InterPro; IPR003593; AAA_ATPase. 
Inter Pro; IPR0 042 73; Dynein_heavy . 
Pfam; PF03028; Dynein_heavy ; 1. 
SMART; SM00382; AAA; 4. 
Motor protein; Dynein; Microtubule; 



ATP-binding; Repeat; Coiled coil 



FT 


DOMAIN 


1 


1856 


STEM (BY SIMILARITY) . 


FT 


DOMAIN 


1857 


2084 


AAA 1 


(BY SIMILARITY) . 


FT 


DOMAIN 


2166 


2437 


AAA 2 


(BY SIMILARITY) . 


FT 


DOMAIN 


2541 


2790 


AAA 3 


(BY SIMILARITY) . 


FT 


DOMAIN 


2884 


3153 


AAA 4 


(BY SIMILARITY) . 


FT 


DOMAIN 


3189 


3478 


STALK 


(BY SIMILARITY) . 


FT 


DOMAIN 


3539 


3768 


AAA 5 


(BY SIMILARITY) . 


FT 


DOMAIN 


3989 


4205 


AAA 6 


(BY SIMILARITY) . 


FT 


DOMAIN 


530 


565 


COILED 


COIL (POTENTIAL) . 


FT 


DOMAIN 


774 


794 


COILED 


COIL (POTENTIAL) . 


FT 


DOMAIN 


1264 


1368 


COILED 


COIL (POTENTIAL) . 


FT 


DOMAIN 


3189 


3261 


COILED 


COIL (POTENTIAL) . 


FT 


DOMAIN 


3382 


3478 


COILED 


COIL (POTENTIAL) . 


FT 


DOMAIN 


3723 


3782 


COILED 


COIL (POTENTIAL) . 


FT 


NP_BIND 


1895 


1902 


ATP (POTENTIAL) . 


FT 


NP_BIND 


2210 


2217 


ATP (POTENTIAL) . 


FT 


NP_BIND 


2580 


2587 


ATP ( POTENTIAL ) . 


FT 


NP_BIND 


2922 


2929 


ATP (POTENTIAL) . 


SQ 


SEQUENCE 


4639 


AA; 530152 


MW; 057A7D8800CCD07E CRC 



Query Match 75.0%; Score 6; DB 1; 

Best Local Similarity 100.0%; Pred. No. 85; 
Matches 6; Conservative 0; Mismatches 



Length 4639; 
0; Indels 



0; Gaps 



0; 



Qy 3 NLDAEF 8 

Mill! 

Db 3885 NLDAEF 3890 



RESULT 7 
RGS3_RAT 

ID RGS3_RAT STANDARD; PRT; 67 AA. 

AC P49797; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Regulator of G-protein signaling 3 (RGS3) (Fragment) . 

GN RGS 3. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 
RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Brain; 

RX MEDLINE=9 614 0 64 5; PubMed-8 54 8 8 15 ; 

RA Koelle M.R., Horvitz H.R.; 

RT "EGL-10 regulates G protein signaling in the C. elegans nervous 

RT system and shares a conserved domain with many mammalian proteins."; 

RL Cell 84:115-125(1996). 

RN [2] 

RP PHOSPHORYLATION. 

RX MEDLINE=20167219; PubMed=107 023 09 ; 

RA Pedram A. , Razandi M. , Kehrl J., Levin E.R.; 

"Natriuretic peptides inhibit G protein activation. Mediation through 
cross-talk between cyclic GMP-dependent protein kinase and regulators 
of G protein-signaling proteins. "; 

RL J. Biol. Chem. 27 5:73 65-7372(2 000). 

CC -!- FUNCTION: Inhibits signal transduction by increasing the GTPase 
cc activity of G protein alpha subunits thereby driving them into 

CC their inactive GDP-bound form. 

CC -!- PTM: Phosphorylated by cyclic GMP-dependent protein kinase. 

CC -!- SIMILARITY: Contains 1 RGS domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U32434; AAC52371.1; 

DR HSSP; P49799; 1AGR. 

DR InterPro; IPR000342; Regl_Gprotein . 

DR Pfam; PF00615; RGS; 1. 

DR PRINTS; PR01301; RGSPROTEIN . 

DR PROSITE; PS50132; RGS; 1. 

KW Signal transduction inhibitor; Phosphorylation. 

FT NON TER 1 1 



RT 
RT 
RT 



CC 
CC 



FT DOMAIN <1 >67 RGS . 

FT NON_TER 67 67 

SQ SEQUENCE 67 AA; 7641 MW; 0 6FE7 630E1CE3AF5 CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 67; 

Best Local Similarity 100.0%; Pred. No. 32; 

Matches 5 ; Conservative 0 ; Mismatches 0 ; Indels 0 ; Gaps 0 ; 

Qy 1 EVNLD 5 

EMM 

Db 34 EVNLD 38 



DT 
DT 



RESULT 8 
PHD_BPP1 

ID PHD^BPPl STANDARD; PRT; 73 AA. 

AC Q06253; 

DT 01-FEB-1995 (Rel. 31, Created) 

01-FEB-1995 (Rel. 31, Last sequence update) 
15-MAR-2004 (Rel. 43, Last annotation update) 
DE Prevent host death protein. 

GN PHD . 

OS Bacteriophage PI. 

OC Viruses; dsDNA viruses, no RNA stage; Caudovirales ; Myoviridae; 

OC Pl-like viruses . 

OX NCBI_TaxID=10678; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=94 016561; PubMed=8 4 1 1 1 5 3 ; 

RA Lehnherr H., Maguin E . , Jafri S., Yarmolinsky M.B.; 

RT "Plasmid addiction genes of bacteriophage PI: doc, which causes cell 

RT death on curing of prophage, and phd, which prevents host death when 

RT prophage is retained."; 

RL J. Mol. Biol. 233:414-428(1993). 

CC -!- FUNCTION: Antidote to the toxic effect of the doc protein on 
cc the host. Phd and doc proteins function in unisson to stabilize 

cc plasmid number by inducing a lethal response to plasmid loss. 

CC -!- MISCELLANEOUS: The concentration of phd in PI lysogens is far 

greater than that of the poison it antagonizes. Such an excess may 
assume the well-being of carriers of the addicting plasmid. 
CC SIMILARITY: BELONGS TO THE PHD/YEFM FAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M95666; AAA16932.1; 

DR PIR; S40015; S40015. 

DR InterPro; IPR006442; Phd_fam. 

DR TIGRFAMs ; TIGR01552; phd_fam; 1. 

FT DOMAIN 57 73 NON-ESSENTIAL FOR ACTIVITY. 

SQ SEQUENCE 73 AA; 8133 MW; 5FDB9D35654 4 0 050 CRC64; 



CC 
CC 



Query Match 62.5%; Score 5; DB 1; Length 73; 

Best Local Similarity 100.0%; Pred. No. 34; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 4 LDAEF 8 

Mill 

Db 52 LDAEF 5 6 



RESULT 9 
EFlB_SULSO 

ID EFlB_SULSO STANDARD; PRT; 90 AA. 

AC Q64214; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Elongation factor 1-beta (EF-l-beta) (aEF-lbeta) . 

GN EF1B OR SS05345. 

OS Sulfolobus solf ataricus . 

OC Archaea; Crenar chaeota ; Thermoprotei ; Sul f olobales ; Sulf olobaceae; 

OC Sulfolobus. 

OX NCBI_TaxID=2287 ; 

RN [1] 

RP SEQUENCE FROM N.A., AND SEQUENCE OF 1-11 AND 82-90. 

RC STRAIN=ATCC 49255 / DSM 5833 / MT-4; 

RX MEDLINE=9535 92 09; PubMed=7 63273 9 ; 

RA Arcari P., Raimo G . , Ianniciello G. , Gallo M. , Bocchini V. ; 

RT "The first nucleotide sequence of an archaeal elongation factor 1 

RT beta gene . " ; 

RL Biochim. Biophys . Acta 1263:86-8 8(1995). 

RN [2] 

RP SEQUENCE FROM N . A. 

RC STRAIN^ATCC 35092 / DSM 1617 / P2; 

RX MEDLINE=213322 9 6; PubMed-1142772 6 ; 

RA She Q., Singh R.K., Confalonieri F . , Zivanovic Y . , Allard G . , 

RA Awayez M.J., Chan-Weiher C.C.-Y., Clausen I.G., Curtis B.A., 

RA De Moors A., Erauso G . , Fletcher C, Gordon P.M.K., 

RA Heikamp-de Jong I., Jeffries A.C., Kozera C.J., Medina N., Peng X., 

RA Thi-Ngoc H.P., Redder P., Schenk M.E., Theriault C, Tolstrup N., 

RA Charlebois R.L., Doolittle W.F., Duguet M. , Gaasterland T., 

RA Garrett R.A., Ragan M.A. , Sensen C.W., Van der Oost J. ; 

RT "The complete genome of the crenarchaeon Sulfolobus solf ataricus P2 . ' 

RL Proc. Natl. Acad. Sci. U.S.A. 9 8:7835-7840(2001). 

RN [3] 

RP SEQUENCE, FUNCTION, SUBUNITS, AND MASS SPECTROMETRY. 

RC STRAIN=ATCC 49255 / DSM 5833 / MT-4; 

RX MEDLINE=9618 62 82 ; PubMed=8 652 615 ; 

RA Raimo G., Masullo M. , Savino G., Scarano G. , Ianniciello G., 

RA Parente A. , Bocchini V. ; 

RT "Archaeal elongation factor 1 beta is a dimer. Primary structure, 

RT molecular and biochemical properties."; 

RL Biochim. Biophys. Acta 12 93:106-112(1996). 

CC FUNCTION: Promotes the exchange of GDP for GTP in EF-l-alpha/GDP, 

cc thus allowing the regeneration of EF-l-alpha/GTP that could then 

cc b e used to form the ternary complex EF-l-alpha/ GTP/AAtRNA. 

CC -!- SUBUNIT: Homodimer. 

CC -!- MASS SPECTROMETRY: MW=10006; METHOD=Electrospray . 



CC -!- SIMILARITY: Belongs to the EF- 1-beta/EF— 1— delta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X76768; CAA54164.1; 

DR EMBL; AE006655; AAK40523.1; -. 

DR PIR; S57268; S57268. 

DR HSSP; 027734; 1GH8 . 

DR HAMAP; MF__00043; - ; 1. 

DR InterPro; IPR004542; aEF-l_beta. 

DR InterPro; IPR001326; EF1_BD. 

DR Pfam; PF00736; EF1BD; 1. 

DR TIGRFAMs; TIGR00489; aEF-l_beta; 1. 

KW Elongation factor; Protein biosynthesis; Complete proteome. 

FT INIT_MET 0 0 

SQ SEQUENCE 90 AA; 10005 MW; 2 4CB357FD3259C8A CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 90; 

Best Local Similarity 100.0%; Pred. No. 41; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLD 5 

I I I I I 

Db 15 EVNLD 19 



RESULT 10 
EF1B_PYRAB 

ID EF1B_PYRAB STANDARD; PRT; 91 AA. 

AC Q9V2P6; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Elongation factor 1-beta (EF-l-beta) (aEF-lbeta) . 

GN EF1B OR PYRAB00290 OR PAB3009. 

OS Pyrococcus abyssi. 

OC Archaea; Euryarchaeota ; Thermococci; Thermococcales ; Thermococcaceae ; 

OC Pyrococcus . 

OX NCBI_TaxID=29292 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=GE5 / Or say; 

RX MEDLINE=225 11545; PubMed=12 622 8 08 ; 

RA Cohen G.N., Barbe V., Flament D., Galperin M. , Heilig R. , Lecompte O., 

RA Poch O., Prieur D., Querellou J., Ripp R., Thierry J.-C, 

RA Van der Oost J., Weissenbach J., Zivanovic Y. , Forterre P.; 

RT "An integrated analysis of the genome of the hyperthermophilic 

RT archaeon Pyrococcus abyssi."; 

RL Mol. Microbiol. 4 7:1495-1512(2003). 

CC -!- FUNCTION: Promotes the exchange of GDP for GTP in EF-l-alpha/GDP , 
CC thus allowing the regeneration of EF-l-alpha/GTP that could then 



CC be used to form the ternary complex EF- 1- alpha/ GTP/AAtRNA (By 

CC similarity) . 

CC SIMILARITY: Belongs to the EF-l-beta/EF-l-delta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AJ248283; CAB48952.1; -. 

DR PIR; A75188; A75188. 

DR HSSP; 027734; 1GH8 . 

DR HAMAP; MF_00043; -; 1. 

DR InterPro; IPR004542; aEF-l_beta. 

DR InterPro; IPR001326; EF1_BD. 

DR Pfam; PF00736; EF1BD; 1. 

DR TIGRFAMs; TIGR00489; aEF-l_beta; 1. 

KW Elongation factor; Protein biosynthesis; Complete proteome, 

SQ SEQUENCE 91 AA; 10239 MW; 45A032D81D54F1EA CRC64; 

Query Match 62.5%; Score 5; DB 1 ; Length 91; 

Best Local Similarity 100.0%; Pred. No. 41; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLD 5 

MM! 

Db 18 EVNLD 22 



RESULT 11 
EF1B PYRFU 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 

oc 
oc 
ox 

RN 
RP 
RC 
RA 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
CC 



41, Created) 

41, Last sequence update) 
41, Last annotation update) 
1-beta (EF-l-beta) (aEF-lbeta) 



Thermococci; Thermococcales ; Thermococcaceae; 



EF1B_PYRFU STANDARD; PRT; 91 AA. 

Q8TZM5; 

28-FEB-2003 (Rel. 
28-FEB-2003 (Rel. 
28-FEB-2003 (Rel. 
Elongation factor 
EF1B OR PF1965. 
Pyrococcus furiosus . 
Archaea; Euryarchaeota ; 
Pyrococcus . 
NCBI_TaxID=22 61; 
[1] 

SEQUENCE FROM N.A. 

STRAIN=Vcl / DSM 3638 / ATCC 43587 / JCM 8422; 
Weiss R.B., Dunn D.M., Robb F.T., Brown J.R.; 
"The complete sequence of the Pyrococcus furiosus genome. 
Submitted (FEB-2002) to the EMBL/ GenBank/DDBJ databases. 

FUNCTION: Promotes the exchange of GDP for GTP in EF-l-alpha/GDP , 
thus allowing the regeneration of EF-l-alpha/GTP that could then 
be used to form the ternary complex EF-l-alpha/ GTP/AAt RNA (By 
similarity) . 

SIMILARITY: Belongs to the EF-l-beta/EF-l-delta family. 



_ t _ 



it 



_ i _ 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC betwe en the Swiss Institute of Bioinf ormatics and the EMBL outstation — 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE010290; AAL82089.1; -. 

DR HAMAP; MF_00043; -; 1. 

DR InterPro; IPR004542; aEF-l_beta. 

DR InterPro; IPR001326; EF1_BD. 

DR Pfam; PF00736; EF1BD; 1. 

DR TIGRFAMs; TIGR00489; aEF-l_beta; 1. 

KW Elongation factor; Protein biosynthesis; Complete proteome. 

SQ SEQUENCE 91 AA; 10322 MW; C54B247AEC77136C CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 91; 

Best Local Similarity 100.0%; Pred. No. 41; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLD 5 

I I I I I 

Db 18 EVNLD 22 



RESULT 12 
EFlB_SULTO 

ID EFlB_SULTO STANDARD; PRT; 91 AA. 

AC Q976H8; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Elongation factor 1-beta (EF-l-beta) (aEF-lbeta) . 

GN EF1B OR ST0208.2 OR STS027. 

OS Sulfolobus tokodaii. 

OC Archaea; Crenarchaeota ; Thermoprotei ; Sulf olobales ; Sulf olobaceae ; 

OC Sulfolobus. 

OX NCBI_TaxID=111955; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=JCM 10545 / 7; 

RX MEDLINE-214 5615 6; PubMed=l 1572 47 9 ; 

RA Kawarabayasi Y. , Hino Y. , Horikawa H., Jin-no K., Takahashi M. , 

RA Sekine M. , Baba S.-I., Ankai A., Kosugi H., Hosoyama A., Fukui S., 

RA Nagai Y . , Nishijima K. , Otsuka R. , Nakazawa H., Takamiya M. , Kato Y., 

RA Yoshizawa T., Tanaka T., Kudoh Y. , Yamazaki J., Kushida N., Oguchi A. 

RA Aoki K.-I., Masuda S., Yanagii M. , Nishimura M. , Yamagishi A., 

RA Oshima T., Kikuchi H.; 

RT "Complete genome sequence of an aerobic thermoacidophilic 

RT Crenarchaeon, Sulfolobus tokodaii strain7. M ; 

RL DNARes. 8:123-140(2001). 

CC -!- FUNCTION: Promotes the exchange of GDP for GTP in EF-l-alpha/GDP, 
CC thus allowing the regeneration of EF-l-alpha/GTP that could then 

CC be used to form the ternary complex EF-1 -alpha/ GTP/ AAtRNA (By 

CC similarity) . 

CC -!- SIMILARITY: Belongs to the EF-l-beta/EF-l-delta family. 



cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; AP000981; BAB65169.1; -. 

DR HAMAP ; MF_00043; 1. 

DR InterPro; IPR004542; aEF-l_beta. 

DR InterPro; IPR001326; EFl_BD. 

DR Pfam; PF00736; EF1BD; 1. 

DR TIGRFAMs ; TIGR00489; aEF-l_beta; 1. 

KW Elongation factor; Protein biosynthesis; Complete proteome . 

SQ SEQUENCE 91 AA; 10006 MW; E8 64 9D4 64 6DC34 8B CRC64 ; 

Query Match 62.5%; Score 5; DB 1; Length 91; 

Best Local Similarity 100.0%; Pred. No. 41; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLD 5 

11:1 

Db 16 EVNLD 20 



RESULT 13 
KM1 1_TRYBB 

ID KM11_TRYBB STANDARD; PRT; 92 AA. 

AC Q26773; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Kinetoplas tid membrane protein-11 (KMP-11). 

GN KMP-11/1 AND KMP-11/2 AND KMP-11/3 AND KMP-11/4. 

OS Trypanosoma brucei brucei, and 

OS Trypanosoma brucei rhodesiense. 

OC Eukaryota; Euglenozoa; Kinetoplas tida ; Trypanosomatidae ; Trypanosoma. 

OX NCBI_TaxID=5702, 31286; 

RN [1] 

RP SEQUENCE FROM N.A., AND SEQUENCE OF 32-45. 

RC SPECIES=T.b. rhodesiense; S T RAI N— Vi Tat 1.1; 

RX MEDLINE=97 04 7 3 87; PubMed=8 8 92 3 07 ; 

RA Stebeck C.E., Baron G.S., Beecroft R.P., Pearson T.W.; 

RT "Molecular characterization of the kinetoplas tid membrane protein-11 

RT from African trypanosomes . " ; 

RL Mol. Biochem. Parasitol. 81:81-88(1996). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC SPECIES=T . b . brucei ; 

RX MEDLINE=9 822 617 9; PubMed=956652 7 ; 

RA Bridge M.A., Zhou Q., Koop B.F., Pearson T.W.; 

RT "Cloning and characterization of the kinetoplastid membrane protein-11 

RT gene locus of Trypanosoma brucei."; 

RL Mol. Biochem. Parasitol. 91:359-363(1998). 

RN [3] 



RP SUBCELLULAR LOCATION. 

RX MEDLINE=95356787 ; PubMed=7 63 037 4 ; 

RA Stebeck C.E., Beecroft R.P., Singh B.N., Jardim A., Olafson R.W., 

RA Tuckey C, Prenevost K.D., Pearson T.W.; 

RT "Kinetoplas tid membrane protein-ll (KMP-11) is differentially 

RT expressed during the life cycle of African trypanosomes and is found 

RT in a wide variety of kinetoplas tid parasites."; 

RL Mol. Biochem. Parasitol. 71:1-13(1995). 

CC -!- FUNCTION: May be involved in the regulation of the cytoskeleton 
CC through interaction with the subpellicular microtubules. May be 

CC involved in parasite mobility and attachment to the surface of the 

CC host cell. Behaves as a strong immunogen during infection (By 

CC similarity) . 

CC -!- SUBUNIT: Monomer (By similarity). 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. Associated with microtubules. 

CC -!- MISCELLANEOUS: There are four copies of the KMP-11 gene in 
CC T.b.brucei while a single copy of the gene is found in 

CC T . b . rhodesiense . 

CC -!- SIMILARITY: Belongs to the KMP-11 family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; X96439; CAA65305.1; -. 

DR EMBL; AF028726; AAC38990.1; -. 

DR EMBL; AF028726; AAC38991.1; -. 

DR EMBL; AF028726; AAC38992.1; -. 

DR EMBL; AF028726; AAC38993.1; -. 

DR GO; GO: 0015630; C : microtubule cytoskeleton; ISS. 

DR GO; GO: 0030260; P:cell invasion; ISS. 

DR GO; GO:0001539; P : ciliary/ flagellar motility ; ISS. 

DR GO; GO: 0007010; P : cytos keleton organization and biogenesis; ISS. 

DR InterPro; IPR004132; KMP11. 

DR Pfam; PF03037; KMP11; 1. 

KW Antigen; Cytoskeleton; Microtubule; Multigene family. 

SQ SEQUENCE 92 AA; 11076 MW; B5 032 8 C8 1A7 7 0E4 C CRC64 ; 



Query Match 62.5%; Score 5; DB 1; Length 92; 

Best Local Similarity 100.0%; Pred. No. 41; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 4 LDAEF 8 

Mill 

Db 15 LDAEF 19 



RESULT 14 
KM11JTRYCR 

ID KM11_TRYCR STANDARD; PRT; 92 AA. 

AC Q9U6Z1; 021435; Q9U6Z0; 

DT 10-OCT-2003 (Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 



DT 10-OCT-2003 (Rel. 42, Last annotation update) 
DE Kinetoplas tid membrane protein-11 (KMP-11) . 
GN KMP-11 OR KMP11. 
OS Trypanosoma cruzi. 

OC Eukaryota; Euglenozoa; Kinetoplas tida ; Trypanosomatidae; Trypanosoma. 

OX NCBI_TaxID=5 693; 
RN [1] 

RP SEQUENCE FROM N.A., SUBCELLULAR LOCATION, AND DEVELOPMENTAL STAGE. 

RC STRAIN=Tulahuen, and Y; 

RX MEDLINE=2 0132225; PubMed=10668 7 91 ; 

RA Thomas M.C., Garcia-Perez J.L., Alonso C, Lopez M.C.; 
RT "Molecular characterization of KMP11 from Trypanosoma cruzi: a 

RT cytoskeleton-associated protein regulated at the translational 

RT level."; 

RL DNA Cell Biol. 19:47-57(2000). 

RN [2] 

RP FUNCTION. 

RX MEDLINE=21195155; PubMed=112 98 135 ; 

RA Thomas M.C., Longobardo M.V., Carmelo E . , Maranon C, Planelles L., 

RA Patarroyo M.E., Alonso C, Lopez M.C.; 

RT "Mapping of the antigenic determinants of the T. cruzi kinetoplastid 

RT membrane protein-11. Identification of a linear epitope specifically 

RT recognized by human Chagasic sera."; 

RL Clin. Exp. Immunol. 123:4 65-471(2 001). 

GC -!- FUNCTION: May be involved in the regulation of the cytoskeleton 
CC through interaction with the subpellicular microtubules. May be 

GC involved in parasite mobility and attachment to the surface of the 

GC host cell. Behaves as a strong immunogen during infection. 

CC -!- SUBUNIT: Monomer (By similarity). 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. Associated with microtubules. 

CC -!- DEVELOPMENTAL STAGE: Abundantly expressed in amastigotes and 
GC trypomastigotes . In epimas tigotes , KMP-11 is expressed at a high 

CC level during the logarithmic growth phase but is downregulated 

CC during the stationary phase of growth. 

CC -!- MISCELLANEOUS: There are four copies of the KMP-11 gene in 
CC T. cruzi. 

CC -!- SIMILARITY: Belongs to the KMP-11 family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AJ000077; CAA03901.1; -. 

DR EMBL; AF167435; AAF04809.1; -. 

DR EMBL; AF167436; AAF04810.1; -. 

DR GO; GO: 0015630; C : microtubule cytoskeleton; IDA. 

DR GO; GO: 0030260; P:cell invasion; NAS . 

DR GO; GO:0001539; P : ciliary/ flagellar motility ; NAS. 

DR GO; GO: 0007010; P : cytoskeleton organization and biogenesis; IDA. 

DR InterPro; IPR004132; KMP11. 

DR Pfam; PF03037; KMP11; 1. 

KW Antigen; Cytoskeleton; Microtubule; Multigene family. 

FT CONFLICT 92 92 MISSING (IN REF. 1; AAF04809) . 



SQ SEQUENCE 92 AA; 11011 MW; 34 1CDD0E2 1D3EFB1 CRC64; 



Query Match 62.5%; Score 5; DB 1; Length 92; 

Best Local Similarity 100.0%; Pred. No. 41; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 4 LDAEF 8 

[Mil 

Db 15 LDAEF 19 



RESULT 15 
AC PS BACHD 



ID ACPS_BACHD STANDARD; PRT; 119 AA. 

AC Q9KFG1; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Holo- [ acyl-carrier protein] synthase (EC 2.7.8.7) (Holo-ACP synthase) 

DE ( 4 ' -phosphopantetheinyl transferase acpS) . 

GN ACPS OR BH0518. 

OS Bacillus halodurans . 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

OX NCBI_TaxID=8 6665; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C-125 / JCM 9153; 

RX MEDLINE=2 0512 5 82; PubMed=11058132 ; 

RA Takami H., Nakasone K., Takaki Y., Maeno G., Sasaki R. , Masui N., 

RA Fuji F. , Hirama C, Nakamura Y. , Ogasawara N., Kuhara S., 

RA Horikoshi K. ; 

RT "Complete genome sequence of the alkaliphilic bacterium Bacillus 

RT halodurans and genomic sequence comparison with Bacillus subtilis. "; 

RL Nucleic Acids Res. 28:4317-4331(2000). 

CC -!- FUNCTION: Transfers the 4 1 -phosphopantetheine moiety from coenzyme 
CC A to a Ser of acyl-carrier protein (By similarity) . 

CC -!- CATALYTIC ACTIVITY: CoA + apo- [ acyl-carrier protein] = adenosine 
CC 3 1 , 5 1 -bisphosphate + holo- [ acyl-carrier protein] . 

CC -!- COFACTOR: Magnesium (By similarity). 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 

CC SIMILARITY: Belongs to the P-Pant transferase superfamily. AcpS 

CC family. 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AP001508; BAB04237.1; -. 

DR PIR; F83714; F83714. 

DR HAMAP; MF_00101; 1. 

DR InterPro; IPR008278; 4-PPT_trans f . 

DR InterPro; IPR002582; ACPS. 

DR InterPro; IPR004568; Pantethn trn. 



DR Pfam; PF01648; ACPS; 1. 

DR ProDom; PD004282; ACPS; 1. 

DR TIGRFAMs; TIGR00516; acpS; 1. 

DR TIGRFAMs; TIGR00556; pantethn_t rn ; 1. 

KW Transferase; Lipid synthesis; Fatty acid biosynthesis; Magnesium; 

KW Complete proteome. 

FT METAL 8 8 MAGNESIUM (BY SIMILARITY) . 

FT METAL 58 58 MAGNESIUM (BY SIMILARITY) . 

SQ SEQUENCE 119 AA; 13421 MW; 227 9E55254 904 1C9 CRC64 ; 

Query Match 62.5%; Score 5; DB 1; Length 119; 

Best Local Similarity 100.0%; Pred. No. 51; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDA 6 

I I I I I 

Db 91 VNLDA 95 



RESULT 16 
PFD4_CAEEL 

ID PFD4_CAEEL STANDARD; PRT; 126 AA. 

AC Q17435; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Probable pref oldin subunit 4 . 

GN B0035.4. 

OS Caenorhabditis elegans . 

OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabdi toidea ; 

OC Rhabditidae; Peloderinae; Caenorhabditis. 

OX NCBI_TaxID=6239; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Bris tol N2 ; 

RA White S . ; 

RL Submitted (MAY-1996) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Bris tol N2 ; 

RA Kohara Y., Shin 1 i T . , Suzuki Y. , Sugano S., Potdevin M. , 

RA Thierry-Mieg Y., Thierry-Mieg D., Thierry-Mieg J.; 

RL Submitted (AUG-2000) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: Binds specifically to cytosolic chaperonin (c-CPN) and 

CC transfers target proteins to it. Binds to nascent polypeptide 

CC chain and promotes folding in an environment in which there are 

CC many competing pathways for nonnative proteins (By similarity) . 

CC SUBUNIT: Heterohexamer of two PFD-alpha type and four PFD-beta 

CC typ e subunits (By similarity) . 

CC -!- SIMILARITY: Belongs to the pref oldin beta subunit family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 



CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Z73102; CAA97410.1; -. 

DR EMBL; AF292051; AAG41147.1; -. 

DR PIR; T18655; T18655. 

DR WormPep; B0035.4; CE05162 . 

KW Chaperone. 

SQ SEQUENCE 126 AA; 14031 MW; 22D8B5B37 0AB4DE8 CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 126; 

Best Local Similarity 100.0%; Pred. No. 53; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 NLDAE 7 

I I I I I 

Db 121 NLDAE 125 



RESULT 17 
GREA CHLTE 



ID GREA_CHLTE STANDARD; PRT; 159 AA. 

AC Q8KCH5; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transcription elongation factor greA (Transcript cleavage factor 

DE greA) . 

GN GREA OR CT144 6. 

OS Chlorobium tepidum. 

OC Bacteria; Chlorobi; Chlorobia; Chlorobiales ; Chlorobiaceae ; 

OC Chlorobium. 

OX NCBI_TaxID=1097 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=TLS / ATCC 49652 / DSM 12025; 

RX MEDLINE=2 2103 68 5; PubMed= 12093901; 

RA Eisen J. A., Nelson K.E., Paulsen I.T., Heidelberg J.F., Wu M. , 

RA Dodson R.J., Deboy R. , Gwinn M.L., Nelson W.C., Haft D.H., 

RA Hickey E.K., Peterson J.D., Durkin A.S., Kolonay J.L., Yang F. , 

RA Holt I., Umayam L.A., Mason T . , Brenner M. , Shea T.P., Parksey D . , 

RA Nierman W.C., Feldblyum T.V., Hansen C.L., Craven M.B., Radune D., 

RA Vamathevan J., Khouri H., White 0., Gruber T.M., Ketchum K.A. , 

RA Venter J.C., Tettelin H., Bryant D.A. , Fraser CM. ; 

RT "The complete genome sequence of Chlorobium tepidum TLS, a 

RT photosynthetic, anaerobic, green-sulfur bacterium, 

RL Proc. Natl. Acad. Sci. U.S.A. 99:9509-9514(2002). 

CC ~!~ FUNCTION: Necessary for efficient RNA polymerase transcription 
CC elongation past template-encoded arresting sites. The arresting 

CC sites in DNA have the property of trapping a certain fraction of 

CC elongating RNA polymerases that pass through, resulting in locked 

CC ternary complexes. Cleavage of the nascent transcript by cleavage 

CC factors such as greA or greB allows the resumption of elongation 

CC from the new 3 1 terminus . GreA releases sequences of 2 to 3 

CC nucleotides (By similarity) . 

CC -!- SIMILARITY: Belongs to the greA/greB family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 



CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; AE012902; AAM72 674.1; -. 

DR TIGR; CT1446; 

DR HAMAP; MF__00105; -; 1. 

DR InterPro; IPR006359; GreA. 

DR InterPro; IPR001437; GreA_GreB. 

DR Pfam; PF01272; GreA_GreB; 1. 

DR Pfam; PF03449; GreA_GreB_N; 1. 

DR ProDom; PD004918; GreA_GreB; 1. 

DR TIGRFAMs; TIGR01462; greA; 1. 

DR PROSITE; PS00829; GREAB_1 ; 1. 

DR PROSITE; PS00830; GREAB_2 ; 1. 

KW Transcription regulation; DNA-binding; Coiled coil; Complete proteome. 

FT DOMAIN 43 76 COILED COIL (POTENTIAL) . 

SQ SEQUENCE 159 AA; 17949 MW; 2B59B2DAE2409AB0 CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 159; 

Best Local Similarity 100.0%; Pred. No. 65; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 NLDAE 7 

I I I I I 

Db 9 6 NLDAE 10 0 



RESULT 18 






YFIR_ 


_ECOLI 






ID 


YFIR ECOLI STANDARD; PRT; 172 AA. 






AC 


P76597; 






DT 


15-DEC-1998 (Rel. 37, Created) 






DT 


15-DEC-1998 (Rel. 37, Last sequence update) 






DT 


16-OCT-2001 (Rel. 40, Last annotation update) 






DE 


Hypothetical protein yfiR precursor. 






GN 


YFIR OR B2603 OR Z3897 OR ECS3466. 






OS 


Escherichia coli, and 






OS 


Escherichia coli 0157 :H7. 






OC 


Bacteria; Proteobacteria ; Gammaproteobacteria ; Enterobacteriales ; 


OC 


Enterobacteriaceae; Escherichia . 






OX 


NCBI TaxID=562, 83334; 






RN 


[1] 






RP 


SEQUENCE FROM N.A. 






RC 


STRAIN=K12 / MG1655; 






RX 


MEDLINE=97 42 6617; PubMed=92 7 8 503 ; 






RA 


Blattner F.R., Plunkett G. Ill, Bloch C.A., Perna 


N. 


T. , Burland V 


RA 


Riley M. , Collado-Vides J., Glasner J.D., Rode C. 


K. , 


Mayhew G. F. , 


RA 


Gregor J., Davis N.W., Kirkpatrick H.A. , Goeden M 


.A. 


, Rose D . J . , 


RA 


Mau B. , Shao Y. ; 






RT 


"The complete genome sequence of Escherichia coli 


K- 


12 . "; 


RL 


Science 277:1453-1474(1997). 






RN 


[2] 






RP 


SEQUENCE FROM N.A. 







RC STRAIN=0157 :H7 / EDL933 / ATCC 700927; 

RX MEDLINE=2 1074 935; PubMed=112 0 655 1 ; 

RA Perna N.T., Plunkett G. Ill, Burland V., Man B., Glasner J.D., 

RA Rose D.J., Mayhew G.F., Evans P.S., Gregor J., Kirkpatrick H.A., 

RA Posfai G., Hackett J., Klink S., Boutin A., Shao Y . , Miller L . , 

RA Grotbeck E.J., Davis N.W., Lim A., Dimalanta E.T., Potamousis K., 

RA Apodaca J., Anantharaman T.S., Lin J., Yen G. , Schwartz D.C., 

RA Welch R.A., Blattner F.R.; 

RT "Genome sequence of enterohaemorrhagic Escherichia coli 0157 :H7."; 

RL Nature 4 09:529-533(2001). 

RN [3] 

RP SEQUENCE FROM N . A. 

RC STRAIN=0157 :H7 / RIMD 0509952; 

RX MEDLINE=2 1156231; PubMed=l 1258 7 96 ; 

RA Hayashi T . , Makino K. , Ohnishi M. , Kurokawa K., Ishii K., Yokoyama K. , 

RA Han C.-G., Ohtsubo E . , Nakayama K., Murata T., Tanaka M. , Tobe T., 

RA Iida T., Takami H., Honda T . , Sasakawa C. f Ogasawara N., Yasunaga T., 

RA Kuhara S., Shiba T . , Hattori M. , Shinagawa H.; 

RT "Complete genome sequence of enterohemorrhagic Escherichia coli 

RT 0157:H7 and genomic comparison with a laboratory strain K-12." ; 

RL DNARes. 8:11-22(2001). 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE000346; AAC75652.1; -. 

DR EMBL; AE005490; AAG57714.1; -. 

DR EMBL; AP002562; BAB36889.1; -. 

DR PIR; B91062; B91062. 

DR PIR; F65038; F65038. 

DR PIR; F85906; F85906. 

DR EcoGene; EG14225; yfiR. 

KW Hypothetical protein; Signal; Complete proteome. 

FT SIGNAL 1 22 POTENTIAL. 

FT CHAIN 2 3 172 HYPOTHETICAL PROTEIN YFIR. 

SQ SEQUENCE 172 AA; 18978 MW; 2E2CC8 12E4 9F3AD5 CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 172; 

Best Local Similarity 100.0%; Pred. No. 69; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDA 6 

EMM 

Db 14 6 VNLDA 15 0 



RESULT 19 
KAD 1_ANAS P 

ID KAD 1_ANAS P STANDARD; PRT; 184 AA. 

AC Q8YPJ8; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 



DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Adenylate kinase 1 (EC 2.7.4.3) (ATP -AMP transphosphorylase 1). 

GN ADK1 OR ALL419 6. 

OS Anabaena sp . (strain PCC 7120) . 

OC Bacteria; Cyanobacteria; Nostocales; Nostocaceae; Nostoc. 

OX NCBI_TaxID=103 690; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=215 952 85; PubMed=117 5984 0; 

RA Kaneko T., Nakamura Y., Wolk CP., Kuritz T . , Sasamoto S., 

RA Watanabe A., Iriguchi M. , Ishikawa A., Kawashima K., Kimura T., 

RA Kishida Y. , Kohara M. , Matsumoto M. , Matsuno A., Muraki A. , 

RA Nakazaki N., Shimpo S., Sugimoto M. , Takazawa M. , Yamada M. , 

RA Yasuda M. , Tabata S.; 

RT "Complete genomic sequence of the filamentous nitrogen-fixing 

RT cyanobacterium Anabaena sp. strain PCC 7120."; 

RL DNARes. 8:205-213(2001). 

CC -!- FUNCTION: This small ubiquitous enzyme is essential for 
CC maintenance and cell growth. 

CC -!- CATALYTIC ACTIVITY: ATP + AMP = ADP + ADP . 

CC -!- SUBUNIT: Monomer (By similarity). 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 

CC -!- SIMILARITY: Belongs to the adenylate kinase family. 

CC — 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AP003595; BAB75895.1; -. 

DR PIR; AE2330; AE2330. 

DR HAMAP; MF_00235; -; 1. 

DR InterPro; IPR006259; Adenyl_kin_sub . 

DR InterPro; IPR0O0850; Adenylate_kin . 

DR Pfam; PF00406; ADK; 1. 

DR PRINTS; PR00094; ADENYLTKNASE . 

DR ProDom; PD000657; Adenylate_kin ; 1. 

DR TIGRFAMs; TIGR01351; adk; 1. 

DR PROSITE; PS00113; ADENYLATE_KINASE; 1. 

KW Transferase; Kinase; ATP-binding; Complete proteome. 

FT NP_BIND 8 16 ATP (BY SIMILARITY) . 

SQ SEQUENCE 184 AA; 20442 MW; A0CEDB117 0D93B69 CRC64 ; 



Query Match 62.5%; Score 5; DB 1; Length 184; 

Best Local Similarity 100.0%; Pred. No. 73; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDA 6 

Mill 

Db 112 VNLDA 116 



RESULT 2 0 
TBP METKA 



ID TBP_METKA STANDARD; PRT ; 18 5 AA. 

AC Q8TX3 8; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE TATA-box binding protein (TATA-box factor) (TATA sequence-binding 

DE protein) (TBP) (Box A binding protein) (BAP) . 

GN TBP OR MK0839. 

OS Methanopyrus kandleri . 

OC Archaea; Euryar chaeota ; Methanopyri; Methanopyrales ; Methanopyraceae; 

OC Methanopyrus . 

OX NCBI_TaxID=2320; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=AV19 / DSM 6324 / JCM 9639; 

RX MEDLINE=2 1927 64 7 ; PubMed=l 19300 14 ; 

RA Slesarev A. I . , Mezhevaya K.V., Makarova K.S., Polushin N.N., 

RA Shcherbinina O.V., Shakhova V.V. , Belova G.I., Aravind L., 

RA Natale D.A., Rogozin I.B., Tatusov R.L., Wolf Y.I., Stetter K.O., 

RA Malykh A.G., Koonin E.V., Kozyavkin S.A.; 

RT "The complete genome of hyperthermophile Methanopyrus kandleri AVI 9 

RT and monophyly of archaeal methanogens . "; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:4644-4649(2002). 

CC -!- FUNCTION: General factor that plays a role in the activation of 
CC archaeal genes transcribed by RNA polymerase. Binds specifically 

CC to the TATA box promoter element which lies close to the position 

CC of transcription initiation (By similarity) . 

CC -!- SIMILARITY: Belongs to the TBP family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE010374; AAM02052.1; -. 

DR HAMAP; MF_00408; -; 1. 

DR InterPro; IPR000814; TFIID. 

DR Pfam; PF00352; TBP; 2. 

DR PRINTS; PR00 68 6; TIFACTORIID. 

DR PROSITE; PS00351; TFIID; FALSE_NEG. 

KW Transcription regulation; DNA-binding; Repeat; Complete proteome. 

FT REPEAT 3 7 8 1 . 

FT REPEAT 94 176 2 . 

SQ SEQUENCE 185 AA; 20730 MW; 986CDA328B72E1B3 CRC64; 



Query Match 62.5%; Score 5; DB 1; Length 185; 

Best Local Similarity 100.0%; Pred. No. 73; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLD 5 

I I I I I 

Db 15 EVNLD 19 



RESULT 21 
0M2 6_HAEIN 

ID OM2 6_HAEIN STANDARD; PRT ; 197 AA. 

AC Q57483; Q9S690; Q9S691; Q9S692; Q9S699; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Outer membrane protein 26 precursor. 

GN OMP26 OR SKP OR HI0916. 

OS Haemophilus influenzae. 

OC Bacteria; Proteobacteria ; Gammaproteobacteria ; Pas teurellales ; 

OC Pas teurellaceae ; Haemophilus. 

OX NCBI_TaxID=727 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Rd / KW20 / ATCC 51907; 

RX MEDLINE=9535 0 630; PubMed=7542 8 00 ; 

RA Fleischmann R.D., Adams M.D., White O., Clayton R.A., Kirkness E.F., 

RA Kerlavage A.R., Bult C.J., Tomb J.-F., Dougherty B.A., Merrick J.M., 

RA McKenney K., Sutton G. , Fitzhugh W., Fields C.A., Gocayne J.D., 

RA Scott J.D., Shirley R. , Liu L.-I., Glodek A., Kelley J.M. , 

RA Weidman J.F., Phillips C.A., Spriggs T., Hedblom E., Cotton M.D., 

RA Utterback T.R., Hanna M.C., Nguyen D.T., Saudek D.M., Brandon R.C., 

RA Fine L.D., Fritchman J.L., Fuhrmann J.L., Geoghagen N.S.M., 

RA Gnehm C.L., McDonald L.A., Small K.V., Fraser CM., Smith H.O., 

RA Venter J. C . ; 

RT "Whole-genome random sequencing and assembly of Haemophilus influenzae 

RT Rd . " ; 

RL Science 269:496-512(1995). 

RN [2] 

RP SEQUENCE FROM N.A., AND CHARACTERIZATION. 

RC STRAIN=NTHI 289, UCl, UC2 , and UC10; 

RX MEDLINE=9 918 5 02 3; PubMed=l 0 0 8 5 0 3 9 ; 

RA El-Adhami W., Kyd J.M., Bastin D.A., Cripps A.W.; 

RT "Characterization of the gene encoding a 2 6-kilodal ton protein (OMP26) 

RT from nontypeable Haemophilus influenzae and immune responses to the 

RT recombinant protein."; 

RL Infect. Immun . 67:1935-1942(1999). 

RN [3] 

RP SEQUENCE OF 24-30. 

RX MEDLINE=201374 8 8 ; PubMed=10 67 502 3 ; 

RA Langen H., Takacs B., Evers S., Berndt P., LahmH.W., Wipf B., 

RA Gray C, Fountoulakis M. ; 

RT "Two-dimensional map of the proteome of Haemophilus influenzae."; 

RL Electrophoresis 21:411-429(2000). 

CC -!- SUBCELLULAR LOCATION: Outer membrane. 

CC -!- SIMILARITY: BELONGS TO THE OMPH/HLPA FAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; U32773; AAC22574.1; -. 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



EMBL; AF109085; AAD23967.1 
EMBL; AF109086; AAD23968.1 
EMBL; AF109087; AAD23969.1 
EMBL; AF109094; AAD23976.1 
PIR; E64102; E64102. 
TIGR; HI0916; 

InterPro; IPR005632; OmpH. 
Pfam; PF03938; OmpH; 1. 

Outer membrane; Signal; Complete proteome . 



SIGNAL 

CHAIN 

VARIANT 

VARIANT 

VARIANT 

VARIANT 



1 
24 
102 
146 
157 
170 



23 
197 
102 
146 
157 
170 



OUTER MEMBRANE PROTEIN 26. 



(IN STRAINS NTHI 
(IN STRAIN UC1) . 
(IN STRAINS NTHI 
(IN STRAINS NTHI 



SEQUENCE 197 AA; 21718 MW; 



Q -> E 
D -> N 
K -> R 
I -> V 
UC2) . 

0C53927521EDB5DC CRC64 ; 



289 AND UC1) 



289 AND UC10) 
289, UC1 AND 



Query Match 62.5%; Score 5; DB 1; 

Best Local Similarity 100.0%; Pred. No. 77; 
Matches 5; Conservative 0; Mismatches 



Length 197; 



0; Indels 



0; Gaps 



0; 



Qy 

Db 



4 LDAEF 8 

MM! 

4 9 LDAEF 53 



RESULT 2 2 
YDB 6_YEAS T 

ID YDB6_YEAST STANDARD; PRT; 197 AA. 

AC Q12055; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Hypothetical 22.7 kDa protein in CDC36-NRP1 intergenic region. 

GN YDL166C. 

OS Saccharomyces cerevisiae (Baker's yeast) . 

OC Eukaryota; Fungi; Ascomycota; Saccharomycotina; Saccharomycetes; 

OC Saccharomycetales ; Saccharomycetaceae ; Saccharomyces. 

OX NCBI_TaxID=4932; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=S288c; 

RA Pohl T.M. ; 

RL Submitted (NOV-1995) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SIMILARITY: Belongs to the UPF0101 family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Z67750; CAA91580.1; 

DR EMBL; Z74214; CAA98740.1; 



DR PIR; S61047; S61047. 

DR GermOnline; 140409; -. 

DR SGD; S0002325; YDL166C. 

DR GO; GO: 0005634; C:nucleus; IDA. 

DR GO; GO: 0006979; P: response to oxidative stress; IMP. 

KW Hypothetical protein; ATP-binding . 

FT NP_BIND 14 21 ATP (POTENTIAL) . 

SQ SEQUENCE 197 AA; 22723 MW; 83 6A2 5B0D7E2 633C CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 197; 

Best Local Similarity 100.0%; Pred. No. 77; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 NLDAE 7 

I I I I I 

Db 124 NLDAE 128 



RESULT 23 
GDIR_YEAST 

ID GDIR_YEAST STANDARD; PRT; 2 02 AA. 

AC Q12434; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Rho GDP-dissociation inhibitor ( Rho GDI). 

GN RDI1 OR YDL135C OR D2175. 

OS Saccharomyces cerevisiae (Baker's yeast). 

OC Eukaryota; Fungi; Ascomycota; Saccharomycotina; Saccharomycetes; 

OC Saccharomycetales ; Saccharomycetaceae; Saccharomyces. 

OX NCBI_TaxID=4 932 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=94 32 7 511; PubMed=8 05105 0 ; 

RA Masuda T., Tanaka K., Nonaka H., Yamochi W., Maeda A., Takai Y. ; 

RT "Molecular cloning and characterization of yeast rho GDP dissociation 

RT inhibitor . " ; 

RL J. Biol. Chem. 2 69:19713-19718(1994). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=S288c / FY1679; 

RX MEDLINE=97127 82 6; PubMed=8 972577 ; 

RA Woelfl S., Haneman V., Saluz H.P.; 

RT "Analysis of a 26,756 bp segment from the left arm of yeast 

RT chromosome IV."; 

RL Yeast 12:154 9-1554(1996). 

CC -!- FUNCTION: Regulates the GDP/GTP exchange reaction of the Rho 

CC proteins by inhibiting the dissociation of GDP from them, and the 

CC subsequent binding of GTP to them. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- SIMILARITY: Belongs to the Rho GDI family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 



CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; D31630; BAA06499.1; 

DR EMBL; X96876; CAA65624.1; 

DR EMBL; Z74183; CAA98708.1; 

DR PIR; A53646; A53646. 

DR HSSP; P52566; 1DS6. 

DR GermOnline; 140378; 

DR SGD; S0002294; RDI1. 

DR GO; GO: 0005829; Crcytosol; IDA. 

DR GO; GO: 0005094; F : Rho GDP-dissociation inhibitor activity; IDA. 

DR GO; GO: 0004871; F: signal transducer activity; IPI. 

DR GO; GO: 0007015; P:actin filament organization; IDA. 

DR GO; GO: 0007264; P: small GTPase mediated signal transduction; IPI. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR000406; Rho_GDI . 

DR Pfam; PF02115; Rho_GDI ; 1. 

DR PRINTS; PR00492; RHOGDI . 

KW GTPase activation. 

SQ SEQUENCE 202 AA; 23138 MW; 300FF97438 3B496F CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 202; 

Best Local Similarity 100.0%; Pred. No. 79; 

Matches 5; Conse rvative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 NLDAE 7 

Mill 

Db 33 NLDAE 37 



RESULT 2 4 
RGS 4_HUMAN 

ID RGS 4_HUMAN STANDARD; PRT; 2 05 AA. 

AC P49798; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 01-OCT-1996 {Rel. 34, Last sequence update) 

DT 15-MAR-2004 {Rel. 43, Last annotation update) 

DE Regulator of G-protein signaling 4 ( RGS 4 ) (RGP4) . 

GN RGS 4. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9 606; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=9617 84 95; PubMed=8 6022 23 ; 

RA Druey K.M., Blumer K.J., Kang V.H., Kehrl J.H.; 

RT "Inhibition of G-protein-mediated MAP kinase activation by a new 

RT mammalian gene family."; 

RL Nature 379:742-746(1996). 

RN [2] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Brain, and Uterus; 

RA Puhl H.L. Ill, Ikeda S.R., Aronstam R.S.; 

RT "cDNA clones of human proteins involved in signal transduction 

RT seguenced by the Guthrie cDNA resource center (www.cdna.org) ."; 



RL Submitted (MAR-2 002) to the EMBL/ GenBank/DDBJ databases. 
RN [3] 

RP SEQUENCE FROM N . A. 
RC TISSUE=Brain; 

RX MEDLINE-22 38 82 57; PubMed-124 77 932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K. , Farmer A.A. , Rubin G.M. , Hong L . , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C* 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C, Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R. A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C, 

RA Rodriguez A.C, Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M. I . , Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences. "; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 
RN [4] 

RP PALMITOYLATION . 

RX MEDLINE=2 0 07 65 07; PubMed=10608 9 01 ; 

RA Tu Y., Popov S., Slaughter C, Ross E.M.; 

RT ,f Palmitoylation of a conserved cysteine in the regulator of G protein 

RT signaling (RGS) domain modulates the GTPase-activating activity of 

RT RGS 4 and RGS10."; 

RL J. Biol. Chem. 274:38260-38267(1999). 
RN [5] 

RP INHIBITION. 

RX MEDLINE=9 8 42152 7; PubMed=974 828 0 ; 

RA Wang J., Ducret A., Tu Y. „ Kozasa T., Aebersold R. f Ross E.M.; 

RT "RGSZ1, a Gz-selective RGS protein in brain. Structure, membrane 

RT association, regulation by Galphaz phosphorylation, and relationship 

RT to a Gz GTPase-activating protein subfamily."; 

RL J. Biol. Chem. 273:2 6014-26025(1998). 

CC -!- FUNCTION: INHIBITS SIGNAL TRANSDUCTION BY INCREASING THE GT PAS E 

CC ACTIVITY OF G PROTEIN ALPHA SUBUNITS THEREBY DRIVING THEM INTO 

CC THEIR INACTIVE GDP-BOUND FORM. ACTIVITY ON G ( Z ) -ALPHA IS INHIBITED 

CC BY PHOSPHORYLATION OF THE G-PROTEIN. ACTIVITY ON G ( Z ) -ALPHA AND 

cc G (I) -ALPHA- 1 IS INHIBITED BY PALMITOYLATION OF THE G-PROTEIN. 

CC -!- PTM: Either Cys-2 or Cys-12 or both are palmi toylated . 

CC -!- PTM: Phosphorylated by cyclic GMP-dependent protein kinase (By 

CC similarity) . 

CC -!- SIMILARITY: Contains 1 RGS domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormati cs and the EMBL outstation - 

CC the European Bioinf orma tics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 



cc 
cc 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

KW 

FT 

FT 

FT 

FT 

SQ 



or send an email to license@isb-sib . ch) 



EMBL; U27768; AAC50395.1; - 
EMBL; AF493928; AAM12642.1; 
EMBL; BC000737; AAH00737.1; 
EMBL; BC051869; AAH51869.1; 
PIR; S78221; S78221. 
HSSP; P49799; 1AGR . 
Genew; HGNC: 10000; RGS4 . 
MIM; 602516; - 



GO 
GO 
GO 
GO 



GO: 0005516 
GO: 0005096 
GO: 0000188. 
GO: 0008277; 



F: calmodulin binding; TAS . 
F : GTPase activator activity; TAS. 
P : inactivation of MAPK; TAS. 

P: regulation of G-protein coupled receptor pr 
Inter Pro; I PRO 0 0342; Regl_Gprotein . 
Pfam; PF00615; RGS; 1. 
PRINTS; PR01301; RGSPROTEIN. 
ProDom; PD001580; Regl_Gprotein ; 1. 
SMART; SM00315; RGS; 1. 
PROSITE; PS50132; RGS; 1. 

Signal transduction inhibitor; Lipoprotein; Palmitate; 
Phosphorylation . 

RGS . 

S-palmitoyl cysteine (Probable) . 
S-palmitoyl cysteine (Probable) . 
S-palmitoyl cysteine. 

7713F1F7496A698B CRC64; 



TAS . 



DOMAIN 
LIPID 
LIPID 
LIPID 
SEQUENCE 



62 
2 
12 
95 
205 AA; 



178 
2 
12 
95 

23255 



MW; 



Query Match 62.5%; Score 5; DB 1; 

Best Local Similarity 100.0%; Pred. No. 80; 
Matches 5; Conservative 0; Mismatches 

Qy 1 EVNLD 5 

I I I I I 

Db 126 EVNLD 130 



Length 205; 



0; Indels 



0; Gaps 



0; 



RESULT 25 
RGS4_MOUSE 

ID RGS4_MOUSE STANDARD; PRT ; 205 AA. 

AC 008899; Q99L30; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Regulator of G-protein signaling 4 ( RGS 4 ) ♦ 

GN RGS 4 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6; TISSUE=Subs tantia nigra; 

RX MEDLINE=98086343; PubMed=942 52 63 ; 

RA Nomoto S., Adachi K., Yang L.X., Hirata Y. , Muraguchi S . , Kiuchi K 

RT "Distribution of RGS 4 mRNA in mouse brain shown by in situ 

RT hybridization. " ; 



RL Biochem. Biophys . Res. Commun . 241:2 81-2 87(1997). 

RN [2] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Breast tumor; 

RX MEDLINE=2238 8257; PubMed=124 7 7 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T . , Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K., Farmer A. A. , Rubin G.M. , Hong L . , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R. A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D. f Dickson M.C., 

RA Rodriguez A.C., Grimwood J w Schmutz J. , Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences. "; 

RL Proc. Natl. Acad. Sci. U.S.A. 9 9:168 99-16903(2 002). 

CC -!- FUNCTION: INHIBITS SIGNAL TRANSDUCTION BY INCREASING THE GTPASE 
CC ACTIVITY OF G PROTEIN ALPHA SUBUNITS THEREBY DRIVING THEM INTO 

CC THEIR INACTIVE GDP-BOUND FORM. ACTIVITY ON G ( Z ) -ALPHA IS INHIBITED 

CC BY PHOSPHORYLATION OF THE G-PROTEIN. ACTIVITY ON G ( Z ) -ALPHA AND 

CC G (I) -ALPHA- 1 IS INHIBITED BY PALMI TO YLAT I ON OF THE G-PROTEIN (BY 

CC SIMILARITY) . 

CC -!- TISSUE SPECIFICITY: EXPRESSED AT HIGH LEVELS IN BRAIN, MODERATELY 
CC LOW LEVELS IN HEART, AND VERY LOW LEVELS IN LUNG, LIVER, AND 

CC SKELETAL MUSCLE. 

CC -!- PTM: Either Cys-2 or Cys-12 or both are palmitoylated (By 
CC similarity) . 

CC -!- PTM: Phosphorylated by cyclic GMP-dependent protein kinase (By 
CC similarity) . 

CC -!- SIMILARITY: Contains 1 RGS domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AB004315; BAA20400.1; -. 

DR EMBL; BC003882; AAH03882.1; 

DR HSSP; P49799; 1AGR. 

DR MGD; MGI: 108409; Rgs4 . 

DR InterPro; IPR000342; Regl_Gprotein . 

DR Pfam; PF00615; RGS; 1. 

DR PRINTS; PR01301; RGSPROTEIN . 

DR ProDom; PD001580; Regl_Gprotein ; 1. 

DR SMART; SM00315; RGS; 1. 



DR PROSITE; PS50132; RGS ; 1. 

KW Signal transduction inhibitor; Lipoprotein; Palmitate; 

KW Phosphorylation. 

FT DOMAIN 62 17 8 RGS . 

FT LIPID 2 2 S-palmitoyl cysteine (By similarity) . 

FT LIPID 12 12 S-palmitoyl cysteine (By similarity) . 

FT LIPID 95 95 S-palmitoyl cysteine (By similarity) . 

FT CONFLICT 162 162 R -> K (IN REF. 2) . 

SQ SEQUENCE 205 AA; 23288 MW; 5D7 95 8 17 11A1F67C CRC64 ; 

Query Match 62.5%; Score 5; DB 1; Length 2 05; 

Best Local Similarity 100.0%; Pred. No. 80; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 EVNLD 5 

MM! 

Db 126 EVNLD 130 



RESULT 2 6 


RGS 4 


_RAT 


ID 


RGS 4 RAT STANDARD; PRT; 205 AA. 


AC 


P49799; 


DT 


01-OCT-1996 (Rel. 34, Created) 


DT 


01-OCT-1996 (Rel. 34, Last sequence update) 


DT 


10-OCT-2003 (Rel. 42, Last annotation update) 


DE 


Regulator of G-protein signaling 4 (RGS 4 ) (RGP4) . 


GN 


RGS 4 . 


OS 


Rattus norvegicus (Rat) . 


OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 


OC 


Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae ; Rattus. 


OX 


NCBI TaxID=10116; 


RN 


[1] 


RP 


SEQUENCE FROM N.A. 


RX 


MEDLINE-9 617 8 4 95; PubMed=8 6 0222 3 ; 


RA 


Druey K.M., Blumer K.J., Kang V.H., Kehrl J.H.; 


RT 


"Inhibition of G-protein-mediated MAP kinase activation by a new 


RT 


mammalian gene family."; 


RL 


Nature 37 9:742-746(1996). 


RN 


[2] 


RP 


SEQUENCE FROM N.A. 


RC 


STRAIN=Sprague-Dawley; 


RA 


Zhou M.-Y., Gomez-Sanchez C.E., Gomez-Sanchez E.P.; 


RT 


"The complete cDNA sequence analysis of the rat RGS4."; 


RL 


Submitted (DEC-1998) to the EMBL/ GenBank/ DDB J databases. 


RN 


[3] 


RP 


SEQUENCE OF 93-159 FROM N.A. 


RC 


TISSUE^Brain; 


RX 


MEDLINE=9 6 14 0 64 5; PubMed=8 5 48815; 


RA 


Koelle M.R., Horvitz H.R.; 


RT 


"EGL-10 regulates G protein signaling in the C. elegans nervous 


RT 


system and shares a conserved domain with many mammalian proteins. "; 


RL 


Cell 84:115-125(1996). 


RN 


[4] 


RP 


PHOSPHORYLATION. 


RX 


MEDLINE=2 01672 19; PubMed=l 07 02309 ; 


RA 


Pedram A., Razandi M. , Kehrl J. r Levin E.R.; 



RT "Natriuretic peptides inhibit G protein activation. Mediation through 

RT cross-talk between cyclic GMP-dependent protein kinase and regulators 

RT of G protein-signaling proteins."; 

RL J. Biol. Chem. 2 75:7365-7372(2 00 0). 
RN [5] 

RP INHIBITION. 

RX MEDLINE= 9 8 016286; PubMed=93 5 319 6; 

RA Tu Y. , Wang J., Ross E.M.; 

RT "Inhibition of brain Gz GAP and other RGS proteins by palmi toylation 

RT of G protein alpha subunits. "; 

RL Science 27 8:1132-1135(1997). 

RN [6] 

RP X-RAY CRYSTALLOGRAPHY (2.8 ANGSTROMS) OF COMPLEX WITH G (I) -ALPHA. 

RX MEDLINE=9 72 62 066; PubMed=91 0 84 8 0 ; 

RA Tesmer J.J.G., Berman D.M., Gilman A.G., Sprang S.R.; 

RT "Structure of RGS 4 bound to AlF4-activated G(i alphal): 

RT stabilization of the transition state for GTP hydrolysis . 11 ; 

RL Cell 89:251-261(1997). 

CC -!- FUNCTION: INHIBITS SIGNAL TRANSDUCTION BY INCREASING THE GTPASE 

CC ACTIVITY OF G PROTEIN ALPHA SUBUNITS THEREBY DRIVING THEM INTO 

CC THEIR INACTIVE GDP— BOUND FORM . ACTIVITY ON G(Z) -ALPHA IS INHIBITED 

CC BY PHOSPHORYLATION OF THE G-PROTEIN . ACTIVITY ON G ( Z ) -ALPHA AND 

CC G (I) -ALPHA- 1 IS INHIBITED BY PALMI TOYLATION OF THE G-PROTEIN. 

CC -!- PTM: Either Cys-2 or Cys-12 or both are palmitoylated (By 

CC similarity) . 

CC PTM: Phosphorylated by cyclic GMP-dependent protein kinase. 

CC -!- SIMILARITY: Contains 1 RGS domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U27767; AAC52440.1; -. 

DR EMBL; AF117211; AAD12065.1; -. 

DR EMBL; U32327; AAC52367.1; 

DR PDB; 1AGR; 16- JUN-97 . 

DR PDB; 1EZT; 15-JAN-01. 

DR PDB; 1EZY; 15- JAN-01 . 

DR InterPro; IPR000342; Regl_Gprotein . 

DR Pfam; PF00615; RGS; 1. 

DR PRINTS; PR01301; RGSPROTEIN . 

DR ProDom; PD001580; Regl_Gprotein ; 1. 

DR SMART; SM0 0315; RGS; 1. 

DR PROSITE; PS50132; RGS; 1. 

KW Signal transduction inhibitor; Lipoprotein; Palmitate; 

KW Phosphorylation; 3D-s tructure . 

FT DOMAIN 62 178 RGS. 

FT LIPID 2 2 S-palmitoyl cysteine (By similarity) . 

FT LIPID 12 12 S-palmitoyl cysteine (By similarity) . 

FT LIPID 95 95 S-palmitoyl cysteine (By similarity) . 

FT HELIX 53 59 

FT TURN 60 61 

FT HELIX 63 68 



FT HELIX 70 82 

FT TURN 83 84 

FT TURN 8 6 8 6 

FT HELIX 87 100 

FT TURN 104 106 

FT HELIX 107 118 

FT TURN 119 119 

FT TURN 121 122 

FT TURN 124 125 

FT HELIX 131 140 

FT TURN 141 142 

FT TURN 146 149 

FT HELIX 150 162 

FT TURN 163 163 

FT HELIX 164 168 

FT TURN 169 170 

FT HELIX 172 175 

FT TURN 176 176 

SQ SEQUENCE 205 AA; 23248 MW; 9647C0EC909D0F6F CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 205; 

Best Local Similarity 100.0%; Pred. No. 80; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 EVNLD 5 

Mill 

Db 12 6 EVNLD 13 0 



RESULT 27 






NFNB 


_ECOLI 






ID 


NFNB ECOLI STANDARD; PRT; 217 AA. 






AC 


P38489; P19575; 






DT 


01-FEB-1991 (Rel. 17, Created) 






DT 


01-OCT-1994 (Rel. 30, Last sequence update) 






DT 


10-OCT-2003 (Rel. 42, Last annotation update) 






DE 


Oxygen-insensitive NAD ( P ) H nitroreductase (EC 


1.-.-.- 


-) ( FMN-dependent 


DE 


nitroreductase) ( Dihydropteridine reductase) 


(EC 1.6. 


99.7) . 


GN 


NFNB OR NFSI OR NFSB OR NTR OR DPRA OR B0578. 






OS 


Escherichia coli . 






OC 


Bacteria; Proteobacteria ; Gammaproteobacteria 


; Enterobacteriales ; 


oc 


Enterobacteriaceae ; Escherichia . 






OX 


NCBI TaxID=562; 






RN 


[1] 






RP 


SEQUENCE FROM N.A. 






RC 


STRAIN=K12 / C600; 






RA 


Zenno S., Koike H., Tanokura M. , Saigo Re- 






RL 


submitted (SEP-1994) to the EMBL/ GenBank/DDBJ 


databases . 


RN 


[2] 






RP 


SEQUENCE FROM N.A. 






RC 


STRAIN=B; 






RX 


MEDLINE=951132 94; PubMed=78 138 8 9 ; 






RA 


Michael N.P., Brehm J.K., Anlezark G.M., Minton N.P.; 




RT 


"Physical characterisation of the Escherichia 


coli B 


gene encoding 


RT 


nitroreductase and its over-expression in Escherichia 


coli K12."; 


RL 


FEMS Microbiol. Lett. 124:195-2 02(1994). 






RN 


[3] 







RP SEQUENCE FROM N.A. 

RC STRAIN-K12 / MG1655; 

RX MEDL INE= 97426617; PubMed= 927 8503; 

RA Blattner F.R., Plunkett G. Ill, Bloch C.A. , Perna N.T., Burland V., 

RA Riley M. , Collado-Vides J. , Glasner J.D., Rode C.K., Mayhew G.F., 

RA Gregor J., Davis N.W., Kirkpatrick H.A., Goeden M.A. , Rose D.J., 

RA Mau B. , Shao Y. ; 

RT "The complete genome sequence of Escherichia coli K-12."; 

RL Science 277:1453-1474(1997). 
RN [4] 

RP SEQUENCE FROM N.A. 

RA Chung E . , Allen E., Araujo R. , Aparicio A., Davis K., Duncan M. , 

RA Feder spiel N., Hyman R. , Kalman S., Komp C, Kurdi O., Lew H., Lin D . , 

RA Namath A. , Oefner P., Roberts D. , Schramm S., Davis R.W.; 

RL Submitted (JAN-1997) to the EMBL/ GenBank/DDB J databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RC STRAIN=K12; 

RX MEDLINE-97 0612 02 ; PubMed=8 9 052 32 ; 

RA Oshima T., Aiba H., Baba T . , Fujita K., Hayashi K., Hon j o A., 

RA Ikemoto K. , Inada T., Itoh T . , Kajihara M. , Kanai K., Kashimoto K. , 

RA Kimura S., Kitagawa M. , Makino K. , Masuda S . , Miki T . , Mizobuchi K., 

RA Mori H., Motomura K. , Nakamura Y. r Nashimoto H., Nishio Y. , Saito N., 

RA Sampei G. , Seki Y., Tagami H., Takemoto K. , Wada C, Yamamoto Y., 

RA Yano M. , Horiuchi T.; 

RT "A 718-kb DNA sequence of the Escherichia coli K-12 genome 

RT corresponding to the 12.7-2 8.0 min region on the linkage map."; 

RL DNA Res. 3:137-155(1996). 

RN [6] 

RP CHARACTERIZATION, AND SEQUENCE OF 1-31 AND 139-180. 

RC STRAIN=B; 

RX MEDLINE=93112106; PubMed=14 72 094 ; 

RA Anlezark G.M. , Melton R.G., Sherwood R.F., Coles B., Friedlos F. , 

RA Knox R.J. ; 

RT "The bioactivation of 5- ( aziridin- 1-yl ) -2 , 4-dinitrobenzamide 

RT (CB1954) — I. Purification and properties of a nitroreductase enzyme 

RT from Escherichia coli — a potential enzyme for antibody-directed 

RT enzyme prodrug therapy (ADEPT). 1 *; 

RL Biochem. Pharmacol. 4 4:2289-2295(1992). 

RN [7] 

RP SEQUENCE OF 1-12. 

RC STRAIN=K12 / EMG2 ; 

RX MEDLINE=97 4 4 397 5; PubMed=92 98 64 6 ; 

RA Link A. J., Robison K., Church G.M. ; 

RT "Comparing the predicted and observed properties of proteins encoded 

RT in the genome of Escherichia coli K-12."; 

RL Electrophoresis 18:1259-1313(1997) . 

RN [8] 

RP SEQUENCE OF 1-20, AND CHARACTERIZATION AS A DIHYDROPTERI DINE REDUCT . 

RX MEDLINE=8 907 6221; PubMed=3 060113; 

RA Vasudevan S.G., Shaw D.C., Armarego W.L.F.; 

RT "Dihydropteridine reductase from Escherichia coli . " ; 

RL Biochem. J. 255:581-588(1988). 

RN [9] 

RP X-RAY CRYSTALLOGRAPHY (2.06 ANGSTROMS). 

RC STRAIN=B ; 

RX MEDLINE=2 047 6666; PubMed=l 1 02 02 7 6 ; 



RA Parkinson G.N., Skelly J.V., Neidle S.; 

RT "Crystal structure of FMN- dependent nitroreductase from Escherichia 

RT coli B: a prodrug-activating enzyme."; 

RL J. Med. Chem. 43:3624-3631(2000). 

CC -!- FUNCTION: REDUCTION OF A VARIETY OF NIT RO AROMAT I C COMPOUNDS USING 

CC NADH (AND TO LESSER EXTENT NADPH) AS SOURCE OF REDUCING 

CC EQUIVALENTS; TWO ELECTRONS ARE TRANSFERRED. CAPABLE OF 

CC NITROFURAZONE, QUINONES AND THE ANTI-TUMOR AGENT CB1954 (5- 

CC (AZIRIDIN— 1-YL) -2, 4 -DINITROBENZAMIDE ) . THE REDUCTION OF CB1954 

CC RESULTS IN THE GENERATION OF CYTOTOXIC SPECIES. 

CC -!- CATALYTIC ACTIVITY: NAD ( P ) H + 6, 7 -dihydropteridine = NAD(P) ( + ) + 

CC 5, 6, 7 , 8-tetrahydropteridine . 

CC -!- COFACTOR: FMN. 

CC -!- SUBUNIT: Monomer or homodimer. 

CC -!- SIMILARITY: Belongs to the nitroreductase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 

DR SWI S S - 2 D PAGE ; P38489; COLI. 

DR EcoGene; EG20151; nfnB. 

DR InterPro; IPR000415; Nitroreductase. 

DR Pfam; PF00881; Nitroreductase; 1. 

KW Oxidoreductase; NAD; NADP; Flavoprotein ; FMN; Complete proteome; 

KW 3D-structure . 



EMBL; 


D25414; 


BAA05004 . 


1; 


EMBL; 


U07860; 


AAC43263 . 


l; -. 


EMBL; 


AE000163; AAC73679.1; - 


EMBL ; 


U82598; 


AAB40776. 


1; 


EMBL; 


D90700; 


BAA35218 . 


1; -. 


EMBL; 


A23284; 


CAA01666. 


1; 


PIR; 


167685; 


167685. 




PIR; 


S01818; 


S01818 . 




PDB; 


1DS7; 12 


-JUL-00 . 




PDB; 


1ICR; 23 


-MAY- 0 1 . 




PDB; 


1ICU; 23 


-MAY- 01 . 




PDB; 


1ICV; 23 


-MAY- 01 . 





FT 


NP_BIND 


153 


158 


NAD OR NADP (BY SIMILARITY) 


FT 


CONFLICT 


5 


5 


S -> C (IN REF. 8) . 


FT 


CONFLICT 


10 


12 


RHS -> CIV (IN REF. 8) . 


FT 


CONFLICT 


19 


19 


S -> M (IN REF. 8) . 


FT 


CONFLICT 


21 


21 


MISSING (IN REF. 6) . 


FT 


CONFLICT 


28 


28 


E -> D (IN REF. 6) . 


FT 


CONFLICT 


180 


180 


E -> I (IN REF. 6) . 


FT 


HELIX 


3 


9 




FT 


STRAND 


12 


12 




FT 


STRAND 


16 


16 




FT 


TURN 


18 


19 




FT 


HELIX 


24 


35 




FT 


TURN 


36 


36 




FT 


HELIX 


40 


42 




FT 


STRAND 


46 


51 





FT 


HELIX 


54 


61 


FT 


TURN 


62 


63 


FT 


HELIX 


66 


71 


FT 


HELIX 


72 


77 


FT 


STRAND 


80 


87 


FT 


HELIX 


92 


104 


FT 


TURN 


105 


106 


FT 


HELIX 


111 


129 


FT 


TURN 


130 


132 


FT 


HELIX 


135 


157 


FT 


TURN 


158 


158 


FT 


STRAND 


160 


161 


FT 


STRAND 


164 


164 


FT 


HELIX 


169 


175 


FT 


TURN 


176 


111 


FT 


HELIX 


179 


181 


FT 


TURN 


182 

*_L. Jl_f 


182 


FT 


STRAND 


183 


192 


FT 


STRAND 


194 


194 


FT 


TURN 


196 


197 


FT 


HELIX 


199 


201 


FT 


TURN 


202 


202 


FT 


HELIX 


210 


213 


FT 


STRAND 


214 


217 


SQ 


SEQUENCE 


217 AA; 


239 



5 MW; A5 16CEFC3D4 6AEAC CRC64 ; 

Query Match 62.5%; Score 5; DB 1; Length 217; 

Best Local Similarity 100.0%; Pred. No. 83; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 4 LDAEF 8 

I I I I I 

Db 172 LDAEF 176 



RESULT 2 8 
NFNB_SALTY 

ID NFNB_SALTY STANDARD; PRT; 217 AA. 

AC P15888; 

DT 01-APR-1990 (Rel. 14, Created) 

DT 01-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Oxygen-insensitive NAD ( P ) H nitroreductase (EC 1.-.-.-). 

GN NFNB OR NFSI OR STM0578 . 

OS Salmonella typhimurium. 

OC Bacteria; Proteobacteria; Gammaproteobacteria ; Enterobacteriales ; 

OC Enterobacteriaceae ; Salmonella. 

OX NCBI_TaxID=602 ; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=TA1538 ; 

RX MEDLINE=9 0 192100; PubMed=2 179862; 

RA Watanabe M. , Ishidate M. , Nohmi T.; 

RT "Nucleotide sequence of Salmonella typhimurium nitroreductase gene."; 

RL Nucleic Acids Res. 18:1059-1059(1990). 

RN [2] 

RP SEQUENCE FROM N . A. 



Db 



18 EVNLD 22 



RESULT 13 
T37632 

hypothetical protein SPACl3F5.07c - fission yeast (Schizosaccharomyces pombe) 
C; Species: Schizosaccharomyces pombe 

C;Date: 03-Dec-1999 #sequence_revision 03-Dec-1999 #text_change 04-Mar-2000 
C; Accession: T37 632 

R;Brown, D.; Churcher, CM.; Barrell, B.G.; Rajandream, M.A. ; Wood, V. 
submitted to the EMBL Data Library, September 1997 
A; Reference number: Z21733 
A;Accession: T37632 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-105 <BRO> 

A; Cross-references : EMBL: Z 99 091; PIDN : CAB117 7 0 . 1; GSPDB : GN00066 ; 
SPDB: SPAC13F5 . 07c 

A; Experimental source: strain 972h-; cosmid cl3F5 
C ; Genetics : 

A;Gene: SPDB: SPAC13F5 . 07c 
A;Map position: 1 

C; Super family : Schizosaccharomyces pombe hypothetical protein SPAC13F5.07c 

Query Match 62.5%; Score 5; DB 2; Length 105; 

Best Local Similarity 100.0%; Pred. No. 83; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 4 LDAEF 8 

I I I IE 

Db 91 LDAEF 95 



RESULT 14 
S65003 

hypothetical protein YLRl54c - yeast ( Saccharomyces cerevisiae) 
N; Alternate names: hypothetical protein L3341 
C; Species: Saccharomyces cerevisiae 

C;Date: Ol-Aug-1995 #sequence_revision 24-May-1996 #text_change 19-Apr-2002 
C'Accession: S65003; S66535 

R;Rieger, M. ; Mueller-Auer, S.; Brueckner, M. 
submitted to the Protein Sequence Database, May 1996 
A; Reference number: S64987 
A;Accession: S65003 
A;Molecule type: DNA 
A; Residues: 1-110 <RIE> 

A; Cross-references : EMBL: Z 7332 6; NID : gl3 60 58 7 ; PIDN : CAA9772 6 . 1; PID: gl36058 8; 
MIPS:YLR154c 

A; Experimental source: strain S288C 
R;van den Berg, M.A. ; Steensma, H.Y. 
Eur. J. Biochem. 231, 704-713, 1995 

A; Title: ACS 2 , a Saccharomyces cerevisiae gene encoding acetyl-coenzyme A 

synthetase, essential for growth on glucose. 

A; Reference number: S66534; MUID : 95377302 ; PMID: 7649171 

A;Accession: S66535 

A; Status: translation not shown 

A;Molecule type: DNA 



A; Residues: 1-110 <VAN> 

A; Cross- references : EMBL: S7 9456; N1D: gll68 013; PIDN : AAB3514 4 . 1; PID: gl6 83551 
C ; Genetics : 

A; Cross-references : SGD: S0004144 
A; Map position: 12R 
A;Note: YLR154c 

C; Superf amily : Saccharomyces hypothetical protein YLR154c 

Query Match 62.5%; Score 5; DB 2; Length 110; 

Best Local Similarity 100.0%; Pred. No. 86; 

Matches 5; Conservative 0; Mismatches 0; Tndels 0; Gaps 0; 

Qy 2 VNLDA 6 

Mill 

Db 6 VNLDA 10 



RESULT 15 
F83714 

holo- (acyl carrier protein) synthase BH0518 [imported] - Bacillus halodurans 
(strain C-125) 

C; Species: Bacillus halodurans 

C;Date: 01-Dec-2000 #sequence_revision 01-Dec-2000 #text_change 15-Jun-2001 
C; Accession: F83714 

R;Takami, H.; Nakasone, K. ; Takaki, Y . ; Maeno, G.; Sasaki, R. ; Masui, N.; Fuji, 
F.; Hirama, C; Nakamura, Y.; Ogasawara, N . ; Kuhara, S.; Horikoshi, K. 
Nucleic Acids Res. 28, 4317-4331, 2000 

A; Title: Complete genome sequence of the alkaliphilic bacterium Bacillus 

halodurans and genomic sequence comparison with Bacillus subtilis. 

A;Reference number: A83650; MUID : 2 0512 582 ; PMID : 1 1058 132 

A;Accession : F8 3714 

A; Status: preliminary 

A; Molecule type: DNA 

A; Residues: 1-119 <STO> 

A; Cross-references : GB: AP0 01508 ; GB : BA000 004 ; NID:gl0172890; PIDN : BAB04237 . 1; 
GSPDB: GN00137 

A; Experimental source: strain C-125 

C ; Genetics : 

A; Gene: BH0518 

C ; Superf amily : holo-ACP synthase 

Query Match 62.5%; Score 5; DB 2; Length 119; 

Best Local Similarity 100.0%; Pred. No. 92; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDA 6 

I I I I I 

Db 91 VNLDA 95 



RESULT 16 
T18655 

hypothetical protein B0035.4 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 21-Jul-2000 
C;Accession: T18655 
R;White, S. 



submitted to the EMBL Data Library, May 1996 
A;Reference number: Z19002 
A;Accession: T18655 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-126 <WIL> 

A;Cross-references: EMBL : Z73 102 ; PIDN : CAA97 4 10.1; GSPDB : GN00022 ; CESP:B0035.4 

A; Experimental source: clone B0035 

C ; Genetics : 

A; Gene : CESP : BO 035 . 4 

A;Map position: 4 

A;Introns: 29/3; 73/3 

C; Superf amily : Arabidopsis thaliana hypothetical protein F22013.29 

Query Match 62.5%; Score 5; DB 2; Length 126; 

Best Local Similarity 100.0%; Pred. No. 97; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 NLDAE 7 

I I II I 

Db 121 NLDAE 12 5 



RESULT 17 
AH1425 

hypothetical secreted protein lmo2809 [imported] - Listeria monocytogenes 
(strain EGD-e) 

C; Species: Listeria monocytogenes 

C;Date: 27-Nov-2001 #sequence_revision 27-Nov-2001 #text_change 27-Nov-2001 
C; Accession: AH1425 

R;Glaser, P.; Frangeul, L.; Buchrieser, C; Amend, A.; Baquero, F . ; Berche, P.; 
Bloecker, H.; Brandt, P.; Chakraborty, T.; Charbit, A.; Chetouani, F.; Couve, 
E. ; de Daruvar, A.; Dehoux, P.; Domann, E.; Dominguez-Bernal , G.; Duchaud, E . ; 
Durand, L.; Dussurget, O.; Entian, K.D.; Fsihi, H.; Garcia-Del Portillo, F . ; 
Garrido, P.; Gautier, L.; Goebel, W.; Gomez-Lopez, N . ; Hain, T . ; Hauf, J.; 
Jackson, D. ; Jones, L.M.; Karst, U. 
Science 294, 849-852, 2001 

A;Authors: Kreft, J.; Kuhn, M. ; Kunst, F . ; Kurapkat, G . ; Madueno, E.; 

Maitournam, A.; Mata Vicente, J.; Ng, E.; Nordsiek, G.; Novella, S.; de Pablos, 

B . ; Perez-Diaz, J.C.; Remmel, B.; Rose, M. ; Rusniok, C; Schlueter, T.; Simoes, 

N.; Tierrez, A.; Vazquez-Boland, J. A.; Voss, H.; Wehland, J.; Cossart, P. 

A; Title: Comparative genomics of Listeria species. 

A; Reference number: AB1077; MUID : 2 15372 7 9 ; PMID : 1167 9669 

A; Accession : AH1425 

A; Status : preliminary 

A;Molecule type: DNA 

A; Residues: 1-126 <GLA> 

A; Cross-references : GB : NC_0 0 32 1 0 ; PIDN : CAD0 1022 . 1; PID : gl 64 12 3 0 9 ; GSPDB : GN0 0 177 

A; Experimental source: strain EGD-e 

C ; Genetics : 

A; Gene: lmo2 8 09 

Query Match 62.5%; Score 5; DB 2 ; Length 126; 

Best Local Similarity 100.0%; Pred. No. 97; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



3 NLDAE 7 



Db 53 NLDAE 57 



RESULT 18 
H69419 

arsenate reductase (arsC) homolog - Archaeoglobus fulgidus 
C; Species: Archaeoglobus fulgidus 

C;Date: 05-Dec-1997 #sequence_revision 05-Dec-1997 #text_change 22-Oct-1999 
C;Accession: H69419 ~~ 

R;Klenk, H.P.; Clayton, R.A.; Tomb, J.F.; White, O.; Nelson, K.E.; Ketchum, 
K.A.; Dodson, R.J.; Gwinn, M. ; Hickey , E . K . ; Peterson, J.D.; Richardson, D.L.; 
Kerlavage, A.R.; Graham, D.E.; Kyrpides, N.C.; Fleischmann, R.D.; Quackenbush, 
J.; Lee, N.H.; Sutton, G.G.; Gill, S.; Kirkness, E.F.; Dougherty, B.A.; McKenny, 
K.; Adams, M.D.; Loftus, B . ; Peterson, S.; Reich, C.I.; McNeil, L.K.; Badger, 
J.H.; Glodek, A.; Zhou, L.; Overbeek, R. ; Gocayne, J.D.; Weidman, J.F.; 
McDonald, L. 

Nature 390, 364-370, 1997 

A;Authors: Utterback, T. ; Cotton, M.D.; Spriggs, T.; Artiach, P.; Kaine, B.P.; 
Sykes, S.M.; Sadow, P.W.; D'Andrea, K.P.; Bowman, C; Fujii, C; Garland, S.A.; 
Mason, T.M.; Olsen, G.J.; Fraser, CM.; Smith, H.O.; Woese, C.R.; Venter, J.C. 
A; Title: The complete genome sequence of the hyperthermophilic, sul fate-reducing 
archaeon Archaeoglobus fulgidus . 

A;Reference number: A69250; MUID : 98 04 9343 ; PMID: 9389475 
A; Access ion: H69419 

A; Status: preliminary; nucleic acid sequence not shown; translation not shown 
A;Molecule type: DNA 
A; Residues: 1-135 <KLE> 

A; Cross -references : GB: AE001010; GB: AE00 07 82 ; NID : g2 68 9333 ; PIDN : AAB8 9 8 85 . 1; 
PID: g2 64 92 14 ; TIGR: AF13 61 

C; Superf amily : protein-tyrosine-phosphatase, low molecular weight 

Query Match 62.5%; Score 5; DB 2; Length 135; 

Best Local Similarity 100.0%; Pred. No. le+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLD 5 

Mil! 

Db 65 EVNLD 69 



RESULT 19 
H72633 

hypothetical protein APE1527 - Aeropyrum pernix (strain Kl) 
C; Species: Aeropyrum pernix 

C;Date: 20-Aug-1999 #sequence_revision 20-Aug-1999 #text_change 20-Jun-2000 
C; Accession: H72 633 

R;Kawarabayasi, Y.; Hino, Y.; Horikawa, H.; Yamazaki, S . ; Haikawa, Y.; Jin-no, 
K.; Takahashi, M. ; Sekine, M. ; Baba, S.; Ankai, A.; Kosugi, H . ; Hosoyama, A. ; 
Fukui, S.; Nagai, Y.; Nishijima, K. ; Nakazawa, H.; Takamiya, M. ; Masuda, S.; 
Funahashi, T.; Tanaka, T . ; Kudoh, Y. ; Yamazaki, J.; Kushida, N.; Oguchi, A.; 
Aoki, K.; Kubota, K . ; Nakamura, Y „ ; Nomura, N.; Sako, Y.; Kikuchi, H. 
DNA Res. 6, 83-101, 1999 

A;Title: Complete genome sequence of an aerobic hyper-thermophilic Crenarchaeon, 
Aeropyrum pernix Kl . 

A;Reference number: A72450; MUID : 9 931 033 9 ; PMID : 103 82966 
A;Accession: H72633 



A; Status: preliminary 
A; Molecule type: DNA 
A; Residues: 1-136 <KAW> 

A;Cross-references: DDB J : APO 00061 ; NID : g5 1 04 82 1 ; PIDN : BAA8 052 6 . 1 ; PID:g5105212 

A; Experimental source: strain Kl 

C ; Genetics : 

A; Gene: APE1527 

C; Superf amily : Aeropyrum pernix hypothetical protein APE1527 

Query Match 62.5%; Score 5; DB 2; Length 136; 

Best Local Similarity 100.0%; Pred. No. le+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 NLDAE 7 

I I I I I 

Db 2 9 NLDAE 3 3 



RESULT 2 0 
S77399 

urease accessory protein E - Synechocys tis sp. (strain PCC 6803) 
N;Alternate names: protein slrl219 
C; Species: Synechocys tis sp. 
A;Variety: PCC 6803 

C;Date: 25-Apr-1997 #sequence_revision 25-Apr-1997 #text_change 28-Jul-2003 
C; Accession : S7739 9 

R;Kaneko, T . ; Sato, S.; Kotani, H.; Tanaka, A.; Asamizu, E.; Nakamura, Y.; 
Miyajima, N.; Hirosawa, M. ; Sugiura, M. ; Sasamoto, S.; Kimura, T.; Hosouchi, T. 
Matsuno, A.; Muraki, A.; Nakazaki, N.; Naruo, K.; Okumura, S.; Shimpo, S.; 
Takeuchi, C; Wada, T. ; Watanabe, A.; Yamada, M. ; Yasuda, M. ; Tabata, S. 
DNA Res. 3, 109-136, 1996 

A;Title: Sequence analysis of the genome of the unicellular cyanobacterium 

Synechocystis sp. PCC6803. II. Sequence determination of the entire genome and 

assignment of potential protein-coding regions. 

A; Reference number: S74322; MUID : 970 612 01 ; PMID: 8905231 

A; Accession: S773 99 

A; Status: nucleic acid sequence not shown; translation not shown 
A;Molecule type: DNA 
A; Residues: 1-142 <KAN> 

A; Cross-references : EMBL: D9 0906; GB : ABO 01339; NID : gl6524 92 ; PIDN : BAA17502 . 1; 
PID: dl 01 82 35; PID: gl 6525 81 

A; Note: the nucleotide sequence was submitted to the EMBL Data Library, June 
1996 

C; Genetics : 
A; Gene: ureE 

C; Superf amily : urease accessory protein (nickel metallochaperone ) , UreE type 

Query Match 62.5%; Score 5; DB 2; Length 142; 

Best Local Similarity 100.0%; Pred. No. 1. le+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLD 5 

I I I I I 

Db 100 EVNLD 104 



RESULT 21 



B83069 

hypothetical protein PA4610 [imported] - Pseudomonas aeruginosa (strain PAOl) 
C; Species: Pseudomonas aeruginosa 

C;Date: 15-Sep~2000 #sequence_revision 15-Sep-2000 #text_change 31-Dec-2000 
C; Accession: B83 069 

R;Stover, C.K.; Pham, X.Q.; Erwin, A.L.; Mizoguchi, S.D.; Warrener, P.; Hickey 
M.J.; Brinkman, F.S.L.; Hufnagle, W.O.; Kowalik, D.J.; Lagrou, M. ; Garber, R.L 
Goltry, L.; Tolentino, E.; Westbrook-Wadman, S.; Yuan, Y. ; Brody, L.L.; Coulte 
S.N.; Folger, K.R.; Kas, A.; Larbig, K. ; Lim, R.M.; Smith, K.A. ; Spencer, D.H. 
Wong, G.K.S.; Wu, Z.; Paulsen, I . T . ; Reizer, J.; Saier, M.H.; Hancock, R.E.W.; 
Lory, S.; Olson, M.V. 
Nature 406, 959-964, 2000 

A;Title: Complete genome sequence of Pseudomonas aeruginosa PAOl, an 
opportunistic pathogen. 

A;Reference number: A82950; MUID : 20437337 ; PMID : 10984043 
A; Access ion: B8 3069 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-144 <STO> 

A; Cross-references : GB:AE0 04 8 75; GB:AE004 0 91; NlD:g9 950857; PIDN : AAG07 9 9 8 . 1; 

GSPDB:GN00131; PASP: PA4610 

A; Experimental source: strain PAOl 

C ; Genetics : 

A; Gene: PA4 610 

Query Match 62.5%; Score 5; DB 2; Length 144; 

Best Local Similarity 100.0%; Pred. No. l.le+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 3 NLDAE 7 

Mill 

Db 108 NLDAE 112 



RESULT 22 
S75581 

hypothetical protein sll0802 - Synechocystis sp . (strain PCC 6803) 
C; Species: Synechocystis sp. 
A;Variety: PCC 6803 

C;Date: 25-Apr-1997 #sequence_revision 25-Apr-1997 #text_change 20-Jun-2000 
C;Accession: S75581 

R;Kaneko, T.; Sato, S.; Kotani, H.; Tanaka, A.; Asamizu, E.; Nakamura, Y.; 
Miyajima, N.; Hirosawa, M. ; Sugiura, M. ; Sasamoto, S.; Kimura, T . ; Hosouchi, T 
Matsuno, A.; Muraki, A.; Nakazaki, N.; Naruo, K. ; Okumura, S.; Shimpo, S.; 
Takeuchi, C; Wada, T.; Watanabe, A.; Yamada, M. ; Yasuda, M. ; Tabata, S. 
DNA Res. 3, 109-136, 1996 

A;Title: Sequence analysis of the genome of the unicellular cyanobacterium 
Synechocystis sp . PCC6803. II. Sequence determination of the entire genome and 
assignment of potential protein-coding regions . 
A;Reference number: S74322; MUID : 97 0 612 0 1 ; PMID: 8905231 
A;Accession: S75581 

A; Status: nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 
A;Residues: 1-147 <KAN> 

A; Cross-references : EMBL:D90 911; GB:AB001339; NID : gl653 083 ; PIDN : BAAl 8 14 2 . 1; 
PID: gl653226 



A; Note: the nucleotide sequence was submitted to the EMBL Data Library, June 
1996 

C; Super family : hypothetical protein slrl203 



Query Match 62.5%; Score 5; DB 2; Length 147; 

Best Local Similarity 100.0%; Pred. No. l.le+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 4 LDAEF 8 

I I I I I 

Db 14 0 LDAEF 14 4 



RESULT 23 
AH1166 

ribose 5-phosphate isomerase homolog lmo0736 [imported] - Listeria monocytogenes 
(strain EGD-e) 

C; Species: Listeria monocytogenes 

C;Date: 27-Nov-2001 #sequence_revision 27-Nov-2001 #text_change 14-Dec-2001 
C;Accession: AH1166 

R;Glaser, P.; Frangeul, L.; Buchrieser, C; Amend, A.; Baquero, F.; Berche, P.; 
Bloecker, H.; Brandt, P.; Chakraborty, T . ; Charbit, A.; Chetouani, F. ; Couve, 
E.; de Daruvar, A.; Dehoux, P.; Domann, E.; Dominguez-Bernal , G . ; Duchaud, E.; 
Durand, L. ; Dussurget, O.; Entian, K.D.; Fsihi, H . ; Garcia-Del Portillo, F. ; 
Garrido, P.; Gautier, L.; Goebel, W.; Gomez-Lopez, N.; Hain, T.; Hauf, J.; 
Jackson, D.; Jones, L.M.; Karst, U. 
Science 294, 849-852, 2001 

A;Authors: Kreft, J.; Kuhn, M. ; Kunst, F. ; Kurapkat, G.; Madueno , E.; 
Maitournam, A.; Mata Vicente, J.; Ng, E.; Nordsiek, G.; Novella, S.; de Pablos, 
B.; Perez-Diaz, J.C.; Remmel, B.; Rose, M. ; Rusniok, C; Schlueter, T . ; Simoes, 
N . ; Tierrez, A.; Vazquez-Boland, J. A.; Voss, H.; Wehland, J.; Cossart, P. 
A;Title: Comparative genomics of Listeria species. 
A;Reference number: AB1077; MUID : 21537279 ; PMID : 1 167 9 669 
A;Accession: AH1166 
A; Status: preliminary 
A; Molecule type: DNA 
A; Residues: 1-148 <GLA> 

A; Cross -references : GB : NC_0032 1 0 ; PIDN : CAC98 8 14 .1; PID : gl 64 10 12 5 ; GSPDB : GN00177 

A; Experimental source: strain EGD-e 

C ; Genetics : 

A; Gene: lmo0736 

C; Super family : galactoside O-acetylt rans f erase 

Query Match 62.5%; Score 5; DB 2; Length 148; 

Best Local Similarity 100.0%; Pred. No. l.le+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 4 LDAEF 8 

I I I I I 

Db 124 LDAEF 128 



RESULT 24 
AC0407 

ribose-5-phosphate isomerase (EC 5.3.1.6) [imported] - Yersinia pestis (strain 
C092) 

C; Species: Yersinia pestis 



C;Date: 02-Nov-2001 #sequence_revision 02-Nov-2001 #text_change 27-Nov-2001 
C; Access ion: AC04 07 

R;Parkhill, J.; Wren, B.W.; Thomson, N.R.; Titball, R.W.; Holden, M.T.G.; 
Prentice, M.B.; Sebaihia, M. ; James, K.D.; Churcher, C. ; Mungall, K.L.; Baker, 
S.; Basham, D . ; Bentley, S.D.; Brooks, K . ; Cerdeno-Tarraga , A.M.; Chillingworth, 
T.; Cronin, A.; Davies, R.M.; Davis, P.; Dougan, G.; Feltwell, T.; Hamlin, N.; 
Holroyd, S.; Jagels, K. ; Leather, S.; Karlyshev, A.V. ; Moule, S.; Oyston, 
P.C.F.; Quail, M. ; Rutherford, K. ; Simmonds, M. ; Skelton, J.; Stevens, K. ; 
Whitehead, S.; Barrell, B.G. 
Nature 413, 523-527, 2001 

A;Title: Genome sequence of Yersinia pestis, the causative agent of plague. 

A; Reference number: AB0001; MUID : 2 14 7 04 13 ; PMID : 11586360 

A; Access ion: AC04 07 

A; Status: preliminary 

A;Molecule type: DNA 

A; Residues: 1-151 <KUR> 

A;Cross-references : GB : AL590842 ; PIDN : CAC9258 3 . 1 ; PID : gl5981280 ; GSPDB : GN0017 5 
C; Genetics : 
A; Gene: rpiB 

C; Super family : galactoside O-acetyltrans f erase 
C;Keywords: intramolecular oxidoreductase ; isomerase 

Query Match 62.5%; Score 5; DB 2 ; Length 151; 

Best Local Similarity 100.0%; Pred. No. l.le+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 4 LDAEF 8 

I I I I I 

Db 12 6 LDAEF 13 0 



RESULT 2 5 
B70800 

hypothetical protein Rv3753c - Mycobacterium tuberculosis (strain H37RV) 
C; Species: Mycobacterium tuberculosis 

C;Date: 17-Jul-1998 #sequence_revision 17-Jul-1998 #text_change 22-Oct-1999 
C;Accession: B70800 

R;Cole, S.T.; Brosch, R. ; Parkhill, J.; Gamier, T.; Churcher, C; Harris, D.; 
Gordon, S.V.; Eiglmeier, K.; Gas, S.; Barry III, C.E.; Tekaia, F. ; Badcock, K. ; 
Basham, D.; Brown, D.; Chillingworth, T . ; Connor, R. ; Davies, R. ; Devlin, K. ; 
Feltwell, T. ; Gentles, S.; Hamlin, N.; Holroyd, S.; Hornsby, T.; Jagels, K.; 
Krogh, A.; McLean, J.; Moule, S.; Murphy, L.; Oliver, S.; Osborne, J.; Quail, 
M.A. ; Rajandream, M.A. ; Rogers, J.; Rutter, S.; Seeger, K.; Skelton, S.; 
Squares, S. 

Nature 393, 537-544, 1998 

A;Authors: Sqares, R. ; Sulston, J.E.; Taylor, K.; Whitehead, S.; Barrell, B.G. 
A; Title: Deciphering the biology of Mycobacterium tuberculosis from the complete 
genome sequence. 

A; Reference number: A70500; MUID: 98295987 ; PMID: 9634230 
A; Access ion: B708 00 

A; Status: preliminary; nucleic acid sequence not shown; translation not shown 
A;Molecule type: DNA 
A; Residues: 1-166 <COL> 

A;Cross-references: GB:AL022121; GB:AL123456; NID : g32 61559 ; PIDN : CAA18075 . 1 ; 

PID: el2 64 613; PID: g2 9 6017 7 

A; Experimental source: strain H37Rv 

C; Genetics : 



A; Gene : Rv3 753c 



Query Match 62.5%; Score 5; DB 2; Length 166; 

Best Local Similarity 100.0%; Pred. No. 1.2e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 NLDAE 7 

I 1 I I I 

Db 95 NLDAE 99 



RESULT 2 6 
G82898 

conserved hypothetical UU380 [imported] - Ureaplasma urealyticum 
C; Species: Ureaplasma urealyticum 

C;Date: 18-Aug-2000 #sequence_revision 20-Aug-2000 #text_change 20-Aug-2000 
C /Accession : G82 8 98 

R;Glass, J.I.; Lefkowitz, E.J.; Glass, J.S.; Heiner, C.R.; Chen, E.Y.; Cassell, 
G.H. 

submitted to GenBank, February 2000 

A; Description : The complete sequence of Ureaplasma urealyticum: Alternate views 

of a minimal genome and sexually transmitted pathogen. 

A; Reference number : A82 8 7 0 

A; Accession : G82898 

A; Status : preliminary 

A;Molecule type: DNA 

A; Residues : 1-168 <GLA> 

A;Cross-references: GB:AE002135; GB:AF222894; NID : g6899358 ; P I DN : AAF3 07 9 0.1; 
GSPDB: GN0 012 3; UUSP:UU3 8 0 

A; Experimental source: serovar 3; biovar 1 

C; Genetics : 

A; Gene: UU3 8 0 

A; Genetic code : SGC3 

Query Match 62.5%; Score 5; DB 2; Length 168; 

Best Local Similarity 100.0%; Pred. No. 1.3e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDA 6 

i I i I I 

Db 35 VNLDA 3 9 



RESULT 2 7 
T08348 

hypothetical protein H1517 [imported] - Halobacterium sp. (strain NRC-1) plasmi 
pNRClOO 

C; Species: Halobacterium sp . 
A;Variety: strain NRC-1 

C;Date: ll-Jun-1999 #sequence_revision ll-Jun-1999 #text_change 03-Nov-2000 
C; Access ion : T0834 8 

R;Ng, W.V. ; Ciufo, S.A.; Smith, T.M.; Bumgarner, R.E.; Baskin, D.; Faust, J.; 
Hall, B. ; Loretz, C; Seto, J.; Slagel, J.; Hood, L.; DasSarma, S. 
Genome Res. 8, 1131-1141, 1998 

A;Title: Snapshot of a large dynamic replicon in a halophilic Archaeon: 
megaplasmid or mini chromosome? 

A; Reference number: Z16408; MUID : 9 9063795 ; PMID: 9847077 



A; Access ion: T08 34 8 

A; Status: translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-17 0 <DAS> 

A; Cross-references : EMBL : AF0164 85 ; NID : g2 8222 7 8 ; PID : g2 822 4 09 ; HALOSP:H1517 

A; Experimental source: strain NRC-1 

C ; Genetics : 

A; Gene: HALOSP :H1517 

A; Genome : plasmid pNRClOO 

Query Match 62.5%; Score 5; DB 2; Length 170; 

Best Local Similarity 100.0%; Pred. No. 1.3e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 4 LDAEF 8 

I I I I I 

Db 7 5 LDAEF 7 9 



RESULT 2 8 
F65038 

hypothetical protein b2603 - Escherichia coli (strain K-12) 
C; Species: Escherichia coli 

C;Date: 12-Sep-1997 #sequence_revision 17-Sep-1997 #text__change Ol-Mar-2002 
C;Accession: F65038 

R;Blattner, F.R.; Plunkett III, G . ; Bloch, C.A.; Perna, N.T.; Burland, V. ; 
Riley, M. ; Collado-Vides , J.; Glasner, J.D.; Rode, C.K.; Mayhew, G.F.; Gregor, 
J.; Davis, N.W.; Kirkpatrick, H . A. ; Goeden, M.A. ; Rose, D.J.; Mau, B.; Shao, Y. 
Science 277, 1453-1462, 1997 

A; Title: The complete genome sequence of Escherichia coli K-12. 
A; Reference number: A64720; MUID : 97 42 6617 ; PMID: 9278503 
A;Accession: F65038 

A; Status: preliminary; nucleic acid sequence not shown; translation not shown 

A; Molecule type: DNA 

A; Residues: 1-172 <BLAT> 

A;Cross-references : GB:AE000346; GB:U00096; NID : g23 67 14 1 ; PIDN : AAC75652 . 1 ; 
PID: gl7 8 8 955; UWGP:b2 603 

A; Experimental source: strain K-12, substrain MG1655 

C; Superf amily : Escherichia coli hypothetical protein b2603 

Query Match 62.5%; Score 5; DB 2; Length 172; 

Best Local Similarity 100.0%; Pred. No. 1.3e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 2 VNLDA 6 

I I I I I 

Db 146 VNLDA 150 



RESULT 2 9 
B91062 

hypothetical protein ECs3466 [imported] - Escherichia coli (strain 0157 :H7, 

substrain RIMD 0509952) 

C; Species: Escherichia coli 

C;Date: 18-Jul-2001 #sequence__revision 18-Jul-2001 #text_change 03-Aug-2001 
C;Accession: B91062 



R;Hayashi, T . ; Makino, K. ; Ohnishi, M. ; Kurokawa, K. ; Ishii, K.; Yokoyama, K. ; 
Han, C.G.; Ohtsubo, E . ; Nakayama, K . ; Murata, T.; Tanaka, M. ; Tobe, T.; lida, 
T . ; Takami, H.; Honda, T . ; Sasakawa, C; Ogasawara, N.; Yasunaga, T . ; Kuhara, 
S.; Shiba, T.; Hattori, M. ; Shinagawa, H. 
DNA Res. 8, 11-22, 2001 

A; Title: Complete genome sequence of enterohemor rhagic Escherichia coli Ol57:H7 

and genomic comparison with a laboratory strain K-12. 

A; Reference number: A99629; MUID : 21156231 ; PMID : 1125 87 9 6 

A;Accession: B91062 

A; Status : preliminary 

A;Molecule type: DNA 

A; Residues: 1-172 <HAY> 

A; Cross-references: GB : BA000007 ; PIDN : BAB3 68 8 9.1; PID : gl3362 937 ; GSPDB : GN00154 

A; Experimental source: strain 0157:H7, substrain RIMD 0509952 

C ; Genetics : 

A; Gene: ECs34 66 

C; Superf amily : Escherichia coli hypothetical protein b2603 

Query Match 62.5%; Score 5; DB 2; Length 172; 

Best Local Similarity 100.0%; Pred. No. 1.3e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 2 VNLDA 6 

I 1 I I I 

Db 14 6 VNLDA 150 



RESULT 30 
F85906 

hypothetical protein Z3897 [imported] - Escherichia coli (strain 0157 :H7, 

substrain EDL933) 

C; Species : Escherichia coli 

C;Date: 16-Feb-2001 #sequence_revis ion 16-Feb-2001 #text_change 14-Sep-2001 
C; Accession: F8 590 6 

R;Perna, N.T.; Plunkett III, G. ; Burland, V.; Mau, B.; Glasner, J.D.; Rose, 
D.J.; Mayhew, G.F.; Evans, P.S.; Gregor, J.; Kirkpatrick, H.A. ; Posfai, G. ; 
Hackett, J.; Klink, S.; Boutin, A.; Shao, Y. ; Miller, L.; Grotbeck, E.J.; Davis, 
N.W.; Lim, A.; Dimalanta, E . ; Potamousis, K. ; Apodaca, J.; Anantharaman , T.S.; 
Lin, J.; Yen, G.; Schwartz, D.C.; Welch, R.A. ; Blattner, F.R. 
Nature 409, 529-533, 2001 

A; Title: Genome sequence of enterohemorrhagic Escherichia coli Ol57:H7. 

A; Reference number: A85480; MUID : 2 1 074 935 ; PMID : 112 0 655 1 

A; Accession: F8 59 06 

A; Status : preliminary 

A;Molecule type: DNA 

A; Residues : 1-172 <ST0> 

A; Cross-references: GB:AE005174; NID : gl2517025 ; PIDN : AAG577 14 . 1 ; GSPDB : GN0 0 145 ; 
UWGP: Z38 97 

A; Experimental source: strain 0157:H7, substrain EDL933 
C ; Genetics : 
A;Gene: Z3897 

C; Superf amily : Escherichia coli hypothetical protein b2603 



Query Match 62.5%; 
Best Local Similarity 100.0%; 
Matches 5; Conservative 



Score 5; DB 2; Length 172; 
Pred. No. 1.3e+02; 
0 ; Mismatches 0 ; Indels 



0; Gaps 



0; 



Qy 2 VNLDA 6 

I I I I I 

Db 146 VNLDA 150 



RESULT 31 
H81251 

NADH2 dehydrogenase (ubiquinone) (EC 1.6.5.3) I chain J Cjl570c [imported] - 
Campylobacter jejuni (strain NCTC 11168) 
C; Species: Campylobacter jejuni 

C;Date: 31-Mar-2000 #sequence__revision 31-Mar-2000 #text_change 03-Jun-2002 
C;Accession: H81251 

R;Parkhill, J.; Wren, B.W.; Mungall, K. ; Ketley, J.M. ; Churcher, C; Basham, D.; 
Chillingworth, T . ; Davies, R.M.; Feltwell, T . ; Holroyd, S.; Jagels, K.; 
Karlyshev, A.; Moule, S.; Fallen, M.J.; Penn, C.W.; Quail, M. ; Rajandream, M.A. ; 
Rutherford, K.M.; VanVliet, A.; Whitehead, S.; Barrell, B.G. 
Nature 403, 665-668, 2000 

A; Title: The genome sequence of the food-borne pathogen Campylobacter jejuni 
reveals hypervariable sequences . 

A;Reference number: A81250; MUID : 2 01509 12 ; PMID : 1068 82 04 
A;Accession: H81251 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-172 <PAR> 

A;Cross-references : GB:AL139079; GB:AL111168; NID : g6968 97 1 ; PIDN : CAB73558 . 1 ; 

PID: g6968987; GSPDB : GN0012 0 ; CJSP:Cj 157 0c 

A; Experimental source: serotype 02 , strain NCTC 11168 

C ; Genetics : 

A;Gene: nuoJ; Cjl570c 

C;Keywords: NAD; oxidoreductase 

Query Match 62.5%; Score 5; DB 2; Length 172; 

Best Local Similarity 100.0%; Pred. No. 1.3e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 4 LDAEF 8 

MM! 

Db 4 8 LDAEF 52 



RESULT 32 
S67579 

probable membrane protein YDL046w - yeast ( Saccharomyces cerevisiae) 
N;Alternate names: hypothetical protein D2699 
C; Species: Saccharomyces cerevisiae 

C;Date: 12-Jul-1996 #sequence_revision 12-Jul-1996 #text_change 19-Apr-2002 
C;Accession: S67579 

R;Paulin, L . ; Saren, A.M.; Laamanen, P. 

submitted to the Protein Sequence Database, July 1996 
A;Reference number: S67560 
A;Accession: S67579 
A;Molecule type: DNA 
A;Residues: 1-173 <PAU> 

A;Cross-references : EMBL:Z74094; NID : gl4 3 1035 ; PID:e253204; PID : gl43l03 6 ; 

GSPDB: GN00004; MIPS:YDL046w 

A; Experimental source: strain S288C 

C;Genetics : 



A; Gene : MIPS : YDL04 6w 

A; Cross -references : SGD: S00022 04 

A;Map position: 4L 

C; Superf amily : Saccharomyces cerevisiae probable membrane protein YDL046w 
C; Keywords: transmembrane protein 

F; 8-24 / Domain : transmembrane #status predicted <TMM> 

Query Match 62.5%; Score 5; DB 2; Length 173; 

Best Local Similarity 100.0%; Pred. No. 1.3e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 1 EVNLD 5 

Mill 

Db 59 EVNLD 63 



RESULT 33 
E64678 

NADH2 dehydrogenase (ubiquinone) (EC 1.6.5.3) chain NQO10 - Helicobacter pylori 
(strain 26695) 

C; Species: Helicobacter pylori 

C;Date: 09-Aug-1997 #sequence_revision 09-Aug-1997 #text__change 03-Jun-2002 
C;Accession: E64678 

R;Tomb, J.F.; White, O.; Kerlavage, A.R.; Clayton, R.A. ; Sutton, G.G.; 
Fleischmann, R.D.; Ketchum, K.A. ; Klenk, H.P.; Gill, S.; Dougherty, B.A.; 
Nelson, K . ; Quackenbush, J.; Zhou, L.; Kirkness, E.F.; Peterson, S.; Loftus, B. 
Richardson, D.; Dodson, R. ; Khalak, H.G.; Glodek, A.; McKenney, K. ; Fitzegerald 
L.M.; Lee, N.; Adams, M.D.; Hickey, E.K.; Berg, D.E.; Gocayne, J.D.; Utterback, 
T.R.; Peterson, J.D.; Kelley, J.M.; Cotton, M.D.; Weidman, J.M.; Fujii, C; 
Bowman, C; Watthey, L. 
Nature 388, 539-547, 1997 

A;Authors: Wallin, E.; Hayes, W.S.; Borodovsky, M. ; Karpk, P.D.; Smith, H.O.; 
Fraser, CM. ; Venter, J.C. 

A; Title: The complete genome sequence of the gastric pathogen Helicobacter 
pylori . 

A; Reference number: A64520; MUID : 97394467 ; PM1D: 9252185 
A;Accession: E64678 

A; Status: preliminary; nucleic acid sequence not shown; translation not shown 
A;Molecule type: DNA 
A; Residues: 1-182 <TOM> 

A;Cross-references: GB:AE000631; GB:AE000511; NID : g2 314 421; PIDN : AAD08313 . 1 ; 
PID : g2314 4 34; TIGR:HP12 69 

C; Superf amily : NADH dehydrogenase (ubiquinone) chain 6 

C; Keywords: electron transfer; membrane-associated complex; NAD; oxidoreductase 

Query Match 62.5%; Score 5; DB 2; Length 182; 

Best Local Similarity 100.0%; Pred. No. 1.3e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 4 LDAEF 8 

I I I I I 

Db 4 8 LDAEF 52 



RESULT 34 
C71839 



NADH2 dehydrogenase (ubiquinone) (EC 1.6.5.3) I chain J - Helicobacter pylori 
(strain J99) 

C; Species: Helicobacter pylori 
A;Variety: strain J99 

C;Date: 12-Feb-1999 #sequence_revision 12-Feb-1999 #text_change 03-Jun-2002 
C;Accession: C71839 

R; Aim, R.A. ; Ling, L.S.L.; Moir, D.T.; King, B.L.; Brown, E.D.; Doig, P.C.; 
Smith, D.R.; Noonan, B.; Guild, B.C.; deJonge, B.L.; Carmel, G. ; Tummino, P.J.; 
Caruso, A.; Uria-Nickelsen, M. ; Mills, D.M.; Ives, C; Gibson, R. ; Merberg, D.; 
Mills, S.D.; Jiang, Q . ; Taylor, D.E.; Vovis, G.F.; Trust, T.J. 
Nature 397, 176-180, 1999 

A; Title: Genomic sequence comparison of two unrelated isolates of the human 
gastric pathogen Helicobacter pylori. 

A;Reference number: A71800; MUID : 9912 0557 ; PMID:9923682 
A;Accession: C71839 
A; Status : preliminary 
A; Molecule type: DNA 
A; Residues: 1-182 <ARN> 

A; Cross-references: GB:AE001545; GB:AE001439; NID : g4 15 57 7 6 ; PIDN : AAD06756 . 1 ; 
PID: g4155780 

A; Experimental source: strain J99 
C ; Genetics : 
A; Gene: nuoJ 

C; Superf amily : NADH dehydrogenase (ubiquinone) chain 6 

C; Keywords: membrane-associated complex; NAD; oxidoreductase 

Query Match 62.5%; Score 5; DB 2; Length 182; 

Best Local Similarity 100.0%; Pred. No. 1.3e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 4 LDAEF 8 

Mill 

Db 4 8 LDAEF 52 



RESULT 35 
AE2330 

adenylate kinase [imported] - Nostoc sp . (strain PCC 7120) 
C;Species: Nostoc sp. PCC 7120 

A;Note: Nostoc sp. strain PCC 7120 is a synonym of Anabaena sp . strain PCC 7120 
C;Date: 14-Dec-2001 #sequence__revision 14-Dec-2001 #text_change 09-Dec~2002 
C; Accession: AE2 33 0 

R;Kaneko, T.; Nakamura, Y . ; Wolk, CP. ; Kuritz, T . ; Sasamoto, S.; Watanabe, A.; 
Iriguchi, M. ; Ishikawa, A.; Kawashima, K.; Kimura, T.; Kishida, Y.; Kohara, M. ; 
Matsumoto, M. ; Matsuno, A.; Muraki, A.; Nakazaki, N . ; Shimpo, S.; Sugimoto, M. ; 
Takazawa, M. ; Yamada, M. ; Yasuda, M. ; Tabata, S. 
DNA Res. 8, 205-213, 2001 

A; Title: Complete Genomic Sequence of the Filamentous Nitrogen-fixing 

Cyanobacterium Anabaena sp . strain PCC 7120. 

A; Reference number: AB1807; MUID : 2 15952 8 5 ; PMID : 11759840 

A;Accession: AE2330 

A; Status : preliminary 

A; Molecule type: DNA 

A; Residues: 1-184 <KUR> 

A/Cross-references: GB:BA000019; PIDN : BAB7 58 95 . 1 ; PID : gl7 13333 1 ; GSPDB : GN0 0 17 9 
A; Experimental source: strain PCC 7120 
C ; Genetics : 



A;Gene: all4196 

C; Super family : adenylate kinase 



Query Match 62.5%; Score 5; DB 2; Length 184; 

Best Local Similarity 100.0%; Pred. No. 1.4e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 2 VNLDA 6 

I I I I I 

Db 112 VNLDA 116 



RESULT 36 
S61047 

hypothetical protein YDLl66c - yeast (Saccharomyces cerevisiae) 
N;Alternate names: hypothetical protein D1487 
C; Species: Saccharomyces cerevisiae 

C;Date: 15-Feb-1996 #sequence_revision 01-Mar-1996 #text_change 19-Apr-2002 
C;Accession: S61047; S67718 
R;Pohl, T.M. 

submitted to the EMBL Data Library, November 1995 
A;Reference number: S61010 
A; Accession : S 61047 
A;Molecule type: DNA 
A; Residues: 1-197 <POH> 

A;Cross-references : EMBL:Z67750; NID : gl 0 6125 6 ; PIDN : CAA915 8 0 . 1 ; PID:gl061273 
R; Pohl, T.M. 

submitted to the Protein Sequence Database, July 1996 
A;Reference number: S67708 
A;Accession: S67718 
A;Molecule type : DNA 
A; Residues: 1-197 <P0W> 

A;Cross-references : EMBL:Z74214; NID : gl4 312 63 ; PIDN : CAA9 8 74 0 . 1 ; PID:gl431264 
MIPS : YDLl66c 

A; Experimental source: strain S28 8C 
C ; Genetics : 

A;Cross-references : SGD: S0002325 
A;Map position: 4L 

C;Superfamily: conserved hypothetical protein MJ1050 

Query Match 62.5%; Score 5; DB 2; Length 197; 

Best Local Similarity 100.0%; Pred. No. 1.4e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 3 NLDAE 7 

[ I I I I 

Db 12 4 NLDAE 12 8 



RESULT 37 
E64102 

skp protein - Haemophilus influenzae (strain Rd KW20) 
C; Species: Haemophilus influenzae 

C;Date: 18-Aug-1995 #sequence__revision 18-Aug-1995 #text_change 08-Oct-1999 
C; Accession : E64 102 

R;Fleischmann, R.D.; Adams, M.D.; White, O.; Clayton, R.A.; Kirkness, E.F.; 
Kerlavage, A.R.; Bult, C.J.; Tomb, J.F.; Dougherty, B.A.; Merrick, J.M.; 



McKenney, K . ; Sutton, G. ; FitzHugh, W. ; Fields, C; Gocayne, J.D.; Scott, J.; 
Shirley, R. ; Liu, L.I.; Glodek, A.; Kelley, J.M. ; Weidman, J.F.; Phillips, C.A.; 
Spriggs, T.; Hedblom, E . ; Cotton, M.D.; Utterback, T.R.; Hanna, M.C.; Nguyen, 
D.T.; Saudek, D.M.; Brandon, R.C.; Fine, L.D.; Fritchman, J.L.; Fuhrmann, J.L.; 
Geoghagen, N . S .M. 
Science 269, 496-512, 1995 

A;Authors: Gnehm, C.L.; McDonald, L.A. ; Small, K.V.; Fraser, CM.; Smith, H.O.; 
Venter, J.C. 

A; Title: Whole-genome random sequencing and assembly of Haemophilus influenzae 
Rd. 

A;Reference number: A64000; MUID : 95350 63 0 ; PMID:7542800 
A/Accession: E64102 

A; Status : nucleic acid sequence not shown; translation not shown 

A; Molecule type: DNA 

A; Residues: 1-197 <TIGR> 

A; Cross-references : GB: U3277 3; GB : L42 023; NID : gl57 3 932 ; PIDN : AAC22 57 4 . 1 ; 
PID: gl57 3 937 ; TIGR: HI 0 916 
C ; Genetics : 
A; Gene : skp 

Query Match 62.5%; Score 5; DB 2 ; Length 197; 

Best Local Similarity 100.0%; Pred. No. 1.4e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 4 LDAEF 8 

Mill 

Db 4 9 LDAEF 53 



RESULT 3 8 
AG0986 

hypothetical luxR-f amily transcription regulator STY4197 [imported] - Salmonella 

enterica subsp. enterica serovar Typhi (strain CT18) 

C; Species: Salmonella enterica subsp. enterica serovar Typhi 

A; Note: this species has also been called Salmonella typhi 

C;Date: 09-Nov-2001 #sequence_revision 09-Nov-2001 #text_change 15-Sep-2003 
C; Accession: AG098 6 

R;Parkhill, J.; Dougan, G.; James, K.D.; Thomson, N.R.; Pickard, D.; Wain, J.; 
Churcher, C; Mungall, K.L.; Bentley, S.D.; Holden, M.T.G.; Sebaihia, M . ; Baker, 
S.; Basham, D.; Brooks, K.; Chillingworth, T.; Connerton, P.; Cronin, A.; Davis, 
P.; Davies, R.M.; Dowd, L.; White, N.; Farrar, J.; Feltwell, T.; Hamlin, N.; 
Haque, A.; Hien, T.T.; Holroyd, S.; Jagels, K . ; Krogh, A.; Larsen, T.S.; 
Leather, S.; Moule, S.; O'Gaora, P. 
Nature 413, 848-852, 2001 

A; Authors: Parry, C; Quail, M. ; Rutherford, K.; Simmonds, M. ; Skelton, J.; 
Stevens, K . ; Whitehead, S.; Barrell, B.G. 

A; Title: Complete genome sequence of a multiple drug resistant Salmonella 
enterica serovar Typhi CT18. 

A; Reference number: AB0502; MUID : 2 1534 94 7 ; PMID : 11677608 
A;Accession: AG0986 
A; Status: preliminary 
A; Molecule type: DNA 
A; Residues: 1-200 <PAR> 

A; Cross- references : GB:AL5133 82; PIDN : CAD08 019 . 1; PID : gl 65050 02 ; GSPDB : GN0 0 17 6 

C ; Geneti cs : 

A; Gene: STY4197 



C; Super family : response regulator with HTH DNA-binding domain, NarL type; 
response regulator homology 



Query Match 62.5%; Score 5; DB 2; Length 200; 

Best Local Similarity 100.0%; Pred. No. 1.5e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 4 LDAEF 8 

i II i I 

Db 56 LDAEF 60 



RESULT 3 9 
A53646 

rho GDP dissociation inhibitor - yeast ( Saccharomyces cerevisiae) 
N;Alternate names: protein D2175; protein YDLl35c 
C; Species: Saccharomyces cerevisiae 

C;Date: 07-Oct-1994 #sequence_revision 07-Oct-1994 #text_change 18-Aug-2000 
C;Accession: A53646; B53646; S67681 

R;Masuda, T.; Tanaka, K.; Nonaka, H.; Yamochi, W. ; Maeda, A.; Takai, Y. 
J. Biol. Chem. 269, 19713-19718, 1994 

A;Title: Molecular cloning and characterization of yeast rho GDP dissociation 
inhibitor . 

A;Reference number: A53646; MUID : 94327 51 1 ; PMID: 8051050 
A;Accession: A53646 
A;Molecule type: DNA 
A; Residues: 1-202 <MAS> 

A;Cross-references : GB:D31630; NID:g516856; PIDN : BAA064 9 9 . 1 ; PID:g516857 
A;Accession: B53646 
A;Molecule type: protein 

A; Residues : 2 7-42 ; 66-7 0 ; 1 1 6- 131 ; 14 0-152 ; 15 5-163 <MA2> 
R;Saluz, H.P.; Woelfl, S.; Hanemann, V. 

submitted to the Protein Sequence Database, July 1996 
A;Reference number: S67677 
A;Accession: S67681 
A;Molecule type: DNA 
A;Residues: 1-202 <SAL> 

A;Cross-references : EMBL:Z74183; NID : gl4 312 06 ; PID : gl431207 ; MIPS:YDLl35c 

A; Experimental source: strain S288C 

C ; Genetics : 

A; Gene: SGD:RDI1 

A;Cross-references : SGD : S 0 0022 94 ; MIPS:YDLl35c 
A;Map position: 4L 

C; Superf amily : human rho GDP dissociation inhibitor 

Query Match 62.5%; Score 5; DB 2; Length 2 02; 

Best Local Similarity 100.0%; Pred. No. 1.5e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 3 NLDAE 7 

I I I M 

Db 33 NLDAE 37 



RESULT 4 0 

S78221 

G-protein 



signaling regulator RGP4 - human 



N; Alternate names: G-protein signaling regulator RGS4 
C; Species: Homo sapiens (man) 

C;Date: 29-Jan-1998 #sequence_revision 13-Feb-1998 #text_change 31-Jan-2000 
C;Accession: S78221; S68509 
R;Druey, K. 

submitted to the EMBL Data Library, May 1995 
A;Reference number: S78089 
A; Accession : S7 8221 
A; Molecule type: mRNA 
A; Residues: 1-205 <DRU> 

A;Cross-references: EMBL:U27768; NID : gl2 16372 ; PIDN : AAC50395 . 1 ; PID:gl216373 
R;Druey, K.M. ; Blumer, K.J.; Kang, V.H.; Kehrl, J.H. 
Nature 379, 742-746, 1996 

A;Title: Inhibition of G-protein-mediated MAP kinase activation by a new 
mammalian gene family. 

A; Reference number: A58012; MUID : 9617 84 95 ; PMID: 8602223 
A/Accession: S68509 

A; Status: preliminary; nucleic acid sequence not shown 
A;Molecule type: mRNA 

A; Residues: 1-35, 'S', 37-39, 'S', 41-46, 'T ■ , 48-67, 1 N 1 , 69-7 6, 'L', 78-177, 'T',179- 

197, 'T 1 , 199-205 <DRW> 

A; Cross-ref erences : EMBL:U27768 

A; Experimental source: brain 

C; Super family : B-cell activation protein BL34 

Query Match 62.5%; Score 5; DB 2; Length 205; 

Best Local Similarity 100.0%; Pred. No. 1.5e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 1 EVNLD 5 

I I I I I 

Db 12 6 EVNLD 130 



Search completed: March 26, 2004, 15:35:56 
Job time : 2 7.5 sees 



RC STRAIN=LT2 / SGSC1412 / ATCC 700720; 

RX MEDLINE— 2 1534948; PubMed=ll 677 609 ; 

RA McClelland M., Sanderson K.E., Spieth J., Clifton S.W., Latreille P., 

RA Courtney L., Porwollik S., Ali J., Dante M. , Du F. , Hou S., Layman D., 

RA Leonard S., Nguyen C, Scott K. , Holmes A., Grewal N . , Mulvaney E., 

RA Ryan E . , Sun H. f Florea L., Miller W. , Stoneking T., Nhan M. , 

RA Waterston R. , Wilson R.K.; 

RT "Complete genome sequence of Salmonella enterica serovar Typhimurium 

RT LT2 . " ; 

RL Nature 413:852-856(2001). 

CC -!- FUNCTION: REDUCTION OF A VARIETY OF N I T ROAROMAT I C COMPOUNDS USING 

CC NADH (AND TO LESSER EXTENT NADPH) AS SOURCE OF REDUCING 

CC EQUIVALENTS; TWO ELECTRONS ARE TRANSFERRED. CAPABLE OF REDUCING 

CC NITROFURAZONE (BY SIMILARITY) . 

CC -!- COFACTOR: FMN . 

CC -!- SUBUNIT: Monomer (By similarity). 

CC -!- SIMILARITY: Belongs to the nitroreductase family. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

cc 

DR EMBL; X17250; CAA35113.1; -. 

DR EMBL; AE008722; AAL19529.1; -. 

DR PIR; S08397; S08397 . 

DR HSSP; P38489; 1DS7 . 

DR StyGene; SG10246; nfnB. 

DR InterPro; IPR000415; Nitroreductase. 

DR Pfam; PF00881; Nitroreductase; 1. 

KW Oxidoreductase ; NAD; NADP; Flavoprotein ; FMN; Complete proteome. 

FT NP_BIND 153 158 NAD OR NADP (BY SIMILARITY) . 

SQ SEQUENCE 217 AA; 23955 MW; E9 0E9E0 5A0 8 2 6D0F CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 217; 

Best Local Similarity 100.0%; Pred. No. 83; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 4 LDAEF 8 

I I I I I 

Db 172 LDAEF 17 6 



RESULT 2 9 
MINC PASMU 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 



MINC_PASMU 
P57845; 
16-OCT-2001 
16-OCT-2001 
28-FEB-2003 



STANDARD; 



PRT; 



225 AA. 



(Rel. 40, Created) 
(Rel. 40, Last sequence update) 
(Rel. 41, Last annotation update) 
Probable septum site-determining protein minC. 
MINC OR PM0443. 
Pasteurella multocida. 

Bacteria; Proteobacteria ; Gammaproteobacteria; 



Pas teurellales ; 



OC Pas teurellaceae ; Pasteurella. 

OX NCBI_TaxID=747; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Pm7 0; 

RX MEDLINE— 2 114 5 8 66; PubMed=l 1 2 4 8100; 

RA May B.J., Zhang Q . , Li L.L., Paustian M.L., Whittam T.S., Kapur V. ; 

RT "Complete genomic sequence of Pasteurella multocida Pm70. M ; 

RL Proc. Natl. Acad. Sci . U.S.A. 98:3460-3465(2001). 

CC FUNCTION: Cell division inhibitor that blocks the formation of 

CC polar Z ring septums . Rapidly oscillates between the poles of the 

CC cell to destabilize ftsZ filaments that have formed before they 

CC mature into polar Z rings. Prevent ftsZ polymerization (By 

CC similarity) . 

CC SUBUNIT: Interacts with minD and ftsZ (By similarity). 

CC -!- SIMILARITY: Belongs to the minC family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE006080; AAK02527.1; -. 

DR HAMAP; MF_002 67; -; 1. 

DR InterPro; IPR005526; MinC. 

DR InterPro; IPR007874; MinC_N. 

DR Pfam; PF03775; MinC_C; 1. 

DR Pfam; PF05209; MinC_N; 1. 

DR TIGRFAMs; TIGR01222; minC; 1. 

KW Cell division; Septation; Complete proteome. 

SQ SEQUENCE 225 AA; 25210 MW; B64BB7 7F8 02AA58E CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 225; 

Best Local Similarity 100.0%; Pred. No. 86; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 4 LDAEF 8 

I I I I I 

Db 180 LDAEF 18 4 



RESULT 30 
Y127 TREPA 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 

ox 

RN 



Y12 7__TREPA 
083164; 
30-MAY-2000 
30-MAY-2000 
16-OCT-2001 



STANDARD; 



PRT; 



229 AA. 



(Rel. 39, Created) 
(Rel. 39, Last sequence update) 
(Rel. 40, Last annotation update) 
Hypothetical protein TP0127. 
TP0127 . 

Treponema pallidum. 

Bacteria; Spi rochaetes ; Spirochaetales ; Spirochaetaceae ; Treponema 
NCBI_TaxID=160; 

[1] 



RP SEQUENCE FROM N.A. 

RC STRAlN=Nichols ; 

RX MEDLINE=98332770; PubMed=9 665 8 7 6 ; 

RA Fraser CM., Norris S.J., Weinstock G.M., White O., Sutton G.G., 

RA Dodson R. , Gwinn M. , Hickey E.K., Clayton R. , Ketchum K.A. , 

RA Sodergren E., Hardham J.M., McLeod M.P., Salzberg S., Peterson J., 

RA Khalak H . , Richardson D., Howell J . K . , Chidambaram M. , Utterback T., 

RA McDonald L . , Artiach P . , Bowman C, Cotton M.D., Fujii C, Garland S., 

RA Hatch B., Horst K. , Roberts K . , Sandusky M. , Weidman J., Smith H.O., 

RA Venter J. C. ; 

RT "Complete genome sequence of Treponema pallidum, the syphilis 

RT spirochete."; 

RL Science 2 81:375-38 8(1998). 

CC -!- SIMILARITY: TO T. PALLIDUM TP0315, TP0618 AND TP0619. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE001198; AAC65119.1; -. 

DR PIR; D713 62; D713 62 . 

DR TIGR; TP0127; -. 

KW Hypothetical protein; Complete proteome. 

SQ SEQUENCE 229 AA; 24870 MW; 39C216395BC6FE88 CRC64 ; 

Query Match 62.5%; Score 5; DB 1; Length 22 9; 

Best Local Similarity 100.0%; Pred. No. 87; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDA 6 

MM! 

Db 8 9 VNLDA 93 



RESULT 31 
CTRA_CAUCR 

ID CTRA_CAUCR STANDARD; PRT; 231 AA. 

AC Q45994; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Cell cycle transcriptional regulator ctrA (Response regulator sokA) 
GN CTRA OR SOKA OR CC3035. 
OS Caulobacter crescentus . 

OC Bacteria; Proteobacteria ; Alphaproteobacteria ; Caulobacterales ; 
OC Caulobacteraceae; Caulobacter. 
OX NCBI_TaxID=1558 92; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-CB15N / NA1000; 

RX MEDLINE=9 614 0642; PubMed=8 54 8 82 9 ; 

RA Quon K.C., Marczynski G.T., Shapiro L . ; 

"Cell cycle control by an essential bacterial two-component signal 



RT 



RT transduction protein."; 

RL Cell 84:83-93(1996). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ATCC 19089 / CB15; 

RX MEDLINE=21173698 ; PubMed-1 12 59 64 7 ; 

RA Nierman W.C., Feldblyum T.V., Laub M.T., Paulsen I.T., Nelson K.E., 

RA Eisen J. A. , Heidelberg J.F., Alley M.R.K., Ohta N., Maddock J.R., 

RA Potocka I., Nelson W.C., Newton A. , Stephens C, Phadke N . D . , Ely B., 

RA DeBoy R.T., Dodson R.J., Durkin A.S., Gwinn M.L., Haft D.H., 

RA Kolonay J.F., Smit J., Craven M.B., Khouri H., Shetty J., Berry K. , 

RA Utterback T., Tran K., Wolf A., Vamathevan J., Ermolaeva M. , White O., 

RA Salzberg S.L., Venter J.C., Shapiro L . , Fraser CM.; 

RT "Complete genome sequence of Caulobacter cres centus . " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 98:4136-4141(2001). 

RN [3] 

RP SEQUENCE OF 159-231 FROM N.A. 

RC STRAIN=ATCC 1908 9 / CB15; 

RX MEDLINE=98132 609; PubMed=94 65034 ; 

RA Wu J., Ohta N., Newton A.; 

RT "An essential, multicomponent signal transduction pathway required for 

RT cell cycle regulation in Caulobacter."; 

RL Proc. Natl. Acad. Sci. U.S.A. 95:1443-144 8(1998). 

CC -!- FUNCTION: FORMS PART OF A TWO-COMPONENT REGULATORY SYSTEM 

CC CTRA/CCKA THAT CONTROLS MULTIPLE EVENTS IN THE CELL CYCLE, 

CC INCLUDING CELL DIVISION, STALK SYNTHESIS, AND CELL CYCLE- S PECI FI C 

CC TRANSCRIPTION. BINDS TO A GROUP OF CELL CYCLE- REGULATED PROMOTERS 

CC CRITICAL FOR DNA REPLICATION, DNA METHYLATION, AND CLASS II 

CC FLAGELLAR BIOGENESIS. 

CC -!- PTM: PHOSPHORYLATED BY CCKA. 

CC — !- SIMILARITY: Contains 1 response regulatory domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U39559; AAA93080.1; -. 

DR EMBL; AE005966; AAK24997.1; 

DR EMBL; AF021339; AAC05479.1; -. 

DR PIR; A87625; A87625. 

DR HSSP; P08402; 1B00. 

DR TIGR; CC3035; -. 

DR InterPro; IPR001789; Response_reg . 

DR InterPro; IPR001867; Trans_reg_C. 

DR Pfam; PF00072; response_reg ; 1. 

DR Pfam; PF00486; trans_reg_C ; 1. 

DR ProDom; PD000039; Response__reg ; 1. 

DR ProDom; PD000329; Trans_reg_C; 1. 

DR SMART; SM00448; REC; 1. 

DR PROSITE; PS50110; RES PONS E_REGULATORY; 1. 

KW Sensory transduction; Phosphorylation; Transcription regulation; 

KW DNA-binding; Complete proteome . 

FT DOMAIN 1 116 RESPONSE REGULATORY . 



FT MOD_RES 51 51 PHOSPHORYLATION (BY SIMILARITY) . 

SQ SEQUENCE 231 AA; 25796 MW; BB8 12AF6EDAADBF8 CRC64; 



Query Match 62.5%; Score 5; DB 1; Length 231; 

Best Local Similarity 100.0%; Pred. No. 88; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDA 6 

INI! 

Db 134 VNLDA 138 



RESULT 32 
RL1 HELP J 



ID RLl_HELP J STANDARD ; PRT; 234 AA. 

AC Q9ZK21; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE 50S ribosomal protein Ll . 

GN RPLA OR JHP1124. 

OS Helicobacter pylori J99 (Campylobacter pylori J99) . 

OC Bacteria; Proteobacteria ; Epsilonproteobacteria ; Campylobacterales ; 

OC Helicobacteraceae ; Helicobacter. 

OX NCBI_TaxID=85 963; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99120557; PubMed=9923682 ; 

RA Aim R.A., Ling L.-S.L., Moir D.T., King B.L., Brown E.D., Doig P.C., 

RA Smith D.R., Noonan B., Guild B.C., deJonge B.L., Carmel G., 

RA Tummino P. J., Caruso A., Uria-Nickelsen M., Mills D.M., Ives C, 

RA Gibson R. , Merberg D., Mills S.D., Jiang Q . , Taylor D.E., Vovis G.F., 

RA Trust T . J. ; 

RT "Genomic sequence comparison of two unrelated isolates of the human 

RT gastric pathogen Helicobacter pylori."; 

RL Nature 397:17 6-18 0(1999). 

CC -!- FUNCTION: This protein binds directly to 23S rRNA and is located 
CC in the neighborhood of the site where elongation factor Tu is 

CC bound to the ribosome (By similarity) . 

CC -!- FUNCTION: Peptides originating from the N-terminal end of Ll have 

CC antibacterial activity against bacteria such as E.coli and 

CC B .megaterium. Has no effect on H. pylori itself. These peptides 

CC may be released in the stomach during altrusitic lysis so as to 

CC kill other fast growing bacteria (By similarity) . 

CC -!- SUBUNIT: Part of the SOS ribosomal subunit. 

CC -!- SIMILARITY: Belongs to the LIP family of ribosomal proteins. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb~sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

cc 

DR EMBL; AE001540; AAD06702.1; 

DR PIR; E71846; E71846. 



DR HSSP; P27150; 1AD2 - 

DR HAMAP; MF_01318; -; 1. 

DR InterPro; IPR002143; Ribosomal_Ll . 

DR InterPro; IPR005878; Ribosomal__Llb/ c . 

DR Pfam; PF00687; Ribosomal__Ll ; 1. 

DR ProDom; PD001314; Ribosomal_Ll ; 1. 

DR TIGRFAMs; TIGR01169; rplA_bact; 1. 

DR PROSITE; PS01199; RIBOSOMAL_Ll ; 1. 

KW Ribosomal protein; RNA-binding; rRNA-binding ; Antibiotic; 

KW Complete proteorae. 

SQ SEQUENCE 234 AA; 25361 MW; 14C7 7CFC14 8 3A2A4 CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 234; 

Best Local Similarity 100.0%; Pred. No. 89; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDA 6 

I I I I I 

Db 223 VNLDA 227 



RESULT 33 
RGS I_HUMAN 

ID RG S I _HUMAN STANDARD; PRT; 235 AA. 

AC Q9NS2 8; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Regulator of G-protein signaling 18 ( RGS 18). 

GN RGS 1 8 OR RGS13. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini ; Hominidae; Homo. 

OX NCBI_TaxID=9 60 6; 

RN [1] 

RP SEQUENCE FROM N . A. 

RA Zhang W . , Wan T., Yuan Z., He L . , Cao X.; 

RT "A novel regulator of G-protein signaling."; 

RL Submitted (JUL-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Platelet ; 

RA Gagnon A.W., Murray D.L., Leadley R.J. Jr.; 

RT "Cloning and characterization of a novel regulator of G-protein 

RT signaling in human platelets."; 

RL Submitted (MAY-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N . A. 

RX PubMed=11042171; 

RA Park I.K., Klug C.A., Li K. , Jerabek L . , Li L., Nanamori M. , 

RA Neubig R.R., Hood L . , Weissman I.L., Clarke M.F.; 

RT "Molecular cloning and characterization of a novel regulator of 

RT G-protein signaling from mouse hematopoietic stem cells."; 

RL J. Biol. Chem. 2 7 6:915-923(2 001). 

RN [4] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Liver; 



RX MEDLINE=22388257; PubMed=124 77 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S . I . , Wang J., Hsieh F . , 

RA Diatchenko L., Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T . L . , Scheetz T.E., 

RA Brownstein M.J., Usdin T . B . , Toshiyuki S . , Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N., Krzywinski M.I. f Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC FUNCTION: INHIBITS SIGNAL TRANSDUCTION BY INCREASING THE GTPASE 

CC ACTIVITY OF G PROTEIN ALPHA SUBUNITS THEREBY DRIVING THEM INTO 

CC THEIR INACTIVE GDP-BOUND FORM. BINDS TO G ( I ) -ALPHA AND G ( Q ) -ALPHA 

CC (BY SIMILARITY) . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 

CC -!- TISSUE SPECIFICITY: Expressed in peripheral leukocytes, bone 

CC marrow, spleen and fetal liver. 

CC -!- SIMILARITY: Contains 1 RGS domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; AF076642; AAF80227.1; -. 

DR EMBL; AF268036; AAK58589.1; -. 

DR EMBL; BC020632; AAH20632.1; 

DR HSSP; P49799; 1AGR. 

DR Genew; HGNC: 14261; RGS 18 . 

DR MIM; 607192; -. 

DR InterPro; IPR000342; Regl_Gprotein . 

DR Pfam; PF00615; RGS; 1. 

DR PRINTS; PR01301; RGS PROTEIN . 

DR ProDom; PD001580; Regl_Gprotein ; 1. 

DR SMART; SM00315; RGS; 1. 

DR PROSITE; PS50132; RGS; 1. 

KW Signal transduction inhibitor. 

FT DOMAIN 86 202 RGS. 

FT CONFLICT 22 6 227 DV -> ML (IN REF. 3) . 

SQ SEQUENCE 235 AA; 27582 MW; 97 3ABDE8EC7DE3D5 CRC64; 



Query Match 62.5%; Score 5; DB 1; Length 235; 

Best Local Similarity 100.0%; Pred. No. 89; 



Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 EVNLD 5 

Mill 

Db 150 EVNLD 154 

RESULT 34 
NPD_RHOBA 

ID NPD_RHOBA STANDARD; PRT ; 239 AA. 

AC Q7UFQ9; 

DT 15-MAR-2004 (Rel. 43, Created) 

DT 15-MAR-2004 (Rel. 43, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE NAD-dependent deacetylase (EC 3.5.1.-) (Regulatory protein SIR2 

DE homolog) . 

GN NPDA OR RB8404 . 

OS Rhodopirellula baltica. 

OC Bacteria; Planctomycetes ; Planctomycetacia ; Planctomycetales ; 

OC Planctomycetaceae ; Pirellula. 

OX NCBI_TaxID=117 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRATN=1 ; 

RX MEDLINE=22735 913; PubMed=12 8 3 5416; 

RA Gloeckner F.O., Kube M. , Bauer M. , Teeling H., Lombardot T., 

RA Ludwig W. , Gade D., Beck A., Borzym K., Heitmann K., Rabus R. , 

RA Schlesner H., Mann R. , Reinhardt R. ; 

RT "Complete genome sequence of the marine planctomycete Pirellula sp . 

RT strain 1 . " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 100:8298-8303(2003). 

CC -!- FUNCTION: Modulates the activities of several enzymes which are 
CC inactive in their acetylated form (By similarity) . 

CC -!- CATALYTIC ACTIVITY: NAD ( + ) + an acetylprotein = nicotinamide + O- 
CC acetyl -ADP-ribose + a protein. 

CC -!- COFACTOR: NAD (By similarity). 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic (Probable). 

CC -f- SIMILARITY: Belongs to the sirtuin family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; BX294147; CAD78623.1; -. 

DR HAMAP; MF_01121; -; 1. 

DR PROSITE; PS50305; SIRTUIN; 1. 

KW Hydrolase; NAD. 

FT DOMAIN 1 239 DEACETYLASE SIRTUIN-TYPE . 

FT DOMAIN 8 21 NAD BINDING (BY SIMILARITY) . 

FT ACT_SITE 111 111 BY SIMILARITY. 

SQ SEQUENCE 239 AA; 26142 MW; CB3FE5A5D3 8 5DDEF CRC64 ; 



Query Match 



62.5%; Score 5; DB 1; Length 239; 



Best Local Similarity 100.0%; Pred. No. 90; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 EVNLD 5 

Mill 

Db 205 EVNLD 209 



RESULT 35 
Y514_METJA 

ID Y514_METJA STANDARD; PRT; 250 AA. 

AC Q57934; 

DT 10-OCT-2003 {Rel. 42, Created) 

DT 10-OCT-2003 (Rel. 42, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Hypothetical poly f erredoxin-like protein MJ0514. 

GN MJ0514. 

OS Methanococcus jannaschii. 

OC Archaea; Euryarchaeota ; Methanococci; Methanococcales ; 

OC Methanocaldococcaceae ; Methanocaldococcus . 

OX NCBI_TaxID=2190; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=JAL-1 / DSM 2661 / ATCC 43067; 

RX MEDLINE=96337 999; PubMed-8 68 8 087 ; 

RA Bult C.J., White O., Olsen G.J., Zhou L . , Fleischmann R.D., 

RA Sutton G.G., Blake J. A. , FitzGerald L.M., Clayton R.A. , Gocayne J.D., 

RA Kerlavage A.R., Dougherty B . A. , Tomb J.-F., Adams M.D., Reich C.I., 

RA Overbeek R. , Kirkness E.F., Weinstock K.G., Merrick J.M., Glodek A., 

RA Scott J.L., Geoghagen N.S.M., Weidman J.F., Fuhrmann J.L., Nguyen D., 

RA Utterback T.R., Kelley J.M., Peterson J.D., Sadow P.W., Hanna M.C., 

RA Cotton M.D., Roberts K.M., Hurst M.A. , Kaine B.P., Borodovsky M. , 

RA Klenk H.-P., Fraser CM., Smith H.O., Woese C.R., Venter J.C.; 

RT "Complete genome sequence of the methanogenic archaeon, Methanococcus 

RT jannaschii."; 

RL Science 2 7 3:1058-1073(1996). 

CC -!- SIMILARITY: Belongs to the bacterial- type ferredoxin family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

cc 

DR EMBL; U67501; AAB98503.1; -. 

DR PIR; B64364; B64364. 

DR HSSP; P00195; 1CLF. 

DR TIGR; MJ0514; -. 

DR InterPro; IPR001450; 4Fe4 S_f erredoxin . 

DR InterPro; IPR000813; 7Fe_f erredoxin . 

DR Pfam; PF00037; fer4; 6. 

DR PRINTS; PR00354; 7FE8SFRDOXIN . 

DR PROSITE; PS00198; 4FE4S_FERREDOXIN ; 6. 

KW Hypothetical protein; Electron transport; Iron-sulfur; 4Fe-4S; 

KW Repeat; Complete proteome. 
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SQ 


SEQUENCE 


250 AA; 


28245 


MW; 5C4D611D53ED2 96 


0 CRC64; 



Query Match 62.5%; Score 5; DB 1; Length 250; 

Best Local Similarity 100.0%; Pred. No. 94; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 EVNLD 5 

Mill 

Db 157 EVNLD 161 



RESULT 3 6 
E1A_ADE41 

ID E1A_ADE4 1 STANDARD; PRT; 251 AA. 

AC P10542; 

DT 01-JUL-1989 (Rel. 11, Created) 

DT 01-JUL-1989 (Rel. 11, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Early ElA 27 kDa protein [Contains: Early ElA 25 kDa protein] . 

OS Human adenovirus type 41. 

OC Viruses; dsDNA viruses, no RNA stage; Adenoviridae; Mas tadenovirus . 

OX NCBI_TaxID=10524; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=8 8 08 4 4 37; PubMed=2 961652 ; 

RA van Loon A.E., Ligtenberg M. , Reemst A.M.C.B., Sussenbach J.S., 

RA Rozi jn T . H. ; 

RT "Structure and organization of the left-terminal DNA regions of 

RT fastidious adenovirus types 40 and 41."; 

RL Gene 58:109-126(1987). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-8 82 0 6069; PubMed=2 834 87 1 ; 



RA Allard A. , Wadell G. ; 

RT "Physical organization of the enteric adenovirus type 41 early region 

RT 1A. " ; 

RL Virology 164:22 0-229(1988). 

CC -!- FUNCTION: Trans-activates early viral promoters and some cellular 

CC promoters . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Comment=I so forms are derived from the El region of the genome; 

CC Name=l; Synonyms=Early E1A 27 kDa protein; 

CC IsoId=P10542-l ; Sequence=Di splayed; 

CC Name=2 ; Synonyms=Early ElA 2 5 kDa protein; 

CC IsoId=P10542-2; Sequence=VSP_0002 06; 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

cc 

DR EMBL; M18289; AAA42450.1; -. 

DR EMBL; M20300; AAA42439.1; 

DR EMBL; M20300; AAA42440.1; -. 

DR PIR; D27333; WMADF4 . 

DR InterPro; IPR003853; Adeno_ElA. 

DR Pfam; PF02703; Adeno_ElA; 1. 

KW Transcription regulation; Activator; Early protein; 

KW Alternative splicing; Zinc-finger; DNA-binding. 

FT ZN_FING 150 170 C4-TYPE . 

FT VARSPLIC 154 181 Missing (in isoform 2) . 

FT / FT I d= VS P_ 0 0 0 2 0 6 . 

SQ SEQUENCE 251 AA; 27476 MW; BA4 8F87 4 87EECE0B CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 251; 
Best Local Similarity 100.0%; Pred. No. 94; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLD 5 

Mill 

Db 92 EVNLD 96 



RESULT 37 
HEMK BUCBP 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 



HEMK_BUCBP 
Q8 9AT0; 
10-OCT-2003 
10-OCT-2003 
10-OCT-2003 



STANDARD; 



PRT; 



277 AA. 



(Rel. 42, Created) 
(Rel. 42, Last sequence update) 
(Rel. 42, Last annotation update) 
HemK protein homolog (EC 2.1.1.-) . 
HEMK OR BBP162. 

Buchnera aphidicola (subsp. Baizongia pistaciae) . 

Bacteria; Proteobacteria ; Gammaproteobacteria; Enterobacteriales ; 
Enterobacteriaceae; Buchnera. 
NCBI TaxID-135842; 



RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22 42 6 901; PubMed=12 5222 65 ; 

RA Van Ham R.C.H.J., Kamerbeek J., Palacios C, Rausell C . , Abascal F., 

RA Bastolla U., Fernandez J.M., Jimenez L . , Postigo M. , Silva F.J., 

RA Tamames J., Viguera E., Latorre A., Valencia A . , Moran F., Moya A,; 

RT "Reductive genome evolution in Buchnera aphidicola . " ; 

RL Proc. Natl. Acad. Sci. U.S.A. 100:581-586(2003). 

CC -!- SIMILARITY: Belongs to the hemK family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; AE014016; AA026895.1; -. 

DR InterPro; IPR002052; N6_Mtase. 

DR InterPro; IPR000051; SAM_bind. 

DR PROSITE; PS00092; N 6_MT AS E ; UNKNOWN_l . 

KW Transferase; Methyltrans f erase ; Complete proteome. 

SQ SEQUENCE 277 AA; 31965 MW; 8 0EC64C95397 5B3F CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 277; 

Best Local Similarity 100.0%; Pred. No. le+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 3 NLDAE 7 

Mill 

Db 22 NLDAE 2 6 



RESULT 3 8 
KPRS METAC 



ID 


KPRS_METAC STANDARD; PRT ; 283 AA. 




AC 


Q8TRK8; 




DT 


28-FEB-2003 (Rel. 41, Created) 




DT 


28-FEB-2003 (Rel. 41, Last sequence update) 




DT 


28-FEB-2003 (Rel. 41, Last annotation update) 




DE 


Ribose-phosphate pyrophosphokinase (EC 2.7.6.1) (RPPK) 


{ Phosphor ibosyl 


DE 


pyrophosphate synthetase) (P-Rib-PP synthetase) (PRPP 


synthetase) . 


GN 


PRS OR MA1167 . 




OS 


Methanosar cina acetivorans . 




OC 


Archaea; Euryarchaeota ; Methanomicrobia ; Methanosar cinales ; 


OC 


Methanosarcinaceae ; Methanosar cina . 




ox 


NCBI TaxID=2214; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN-C2A / ATCC 35395 / DSM 2834; 




RX 


MEDLINE=21929760; PubMed=ll 9322 38 ; 




RA 


Galagan J.E., Nusbaum C, Roy A., Endrizzi M.G., Macdonald P., 


RA 


FitzHugh W., Calvo S., Engels R. , Smirnov S., Atnoor D 


. , Brown A. , 


RA 


Allen N., Naylor J., Stange-Thomann N., DeArellano K . , 


Johnson R. , 


RA 


Linton L . , McEwan P., McKernan K., Talamas J., Tirrell 


A. , Ye W. , 


RA 


Zimmer A., Barber R.D., Cann I., Graham D.E., Grahame 


D . A. , Guss A.M., 



RA Hedderich R. , Ingram-Smith C, Kuettner H.C., Krzycki J. A. , 

RA Leigh J. A. , Li W. , Liu J. , Mukhopadhyay B., Reeve J.N., Smith K., 

RA Springer T.A., Umayam L.A., White O . , White R.H., de Macario E.C., 

RA Ferry J.G., Jarrell K.F., Jing H . , Macario A.J.L., Paulsen I., 

RA Pritchett M. , Sowers K.R., Swanson R.V. , Zinder S.H., Lander E., 

RA Metcalf W.W., Birren B.; 

RT "The genome of Methanosarcina acetivorans reveals extensive metabolic 

RT and physiological diversity."; 

RL Genome Res. 12:532-542(2002). 

CC -!- CATALYTIC ACTIVITY: ATP + D-ribose 5-phosphate = AMP + 5-phospho- 
CC alpha-D-ribose 1-diphosphate . 

CC -!- COFACTOR: Binds 1 magnesium ion per subunit (By similarity) . 

CC -!- PATHWAY: Utilized by both the de novo and the salvage pathways by 

CC which endogenously formed or exogenously added pyrimidine, purine, 

CC or pyridine bases are converted to the corresponding 

CC ribonucleoside monophosphates. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 

CC -!- SIMILARITY: Belongs to the ribose-phosphate pyrophosphokinase 

CC family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AE010783; AAM0458 8 . 1 ; -. 

DR HAMAP; MF_00583; -; 1. 

DR InterPro; IPR000842; PRPP_synthetase . 

DR InterPro; IPR000836; PRTrans f erase . 

DR InterPro; IPR005946; RibP_Ppkin. 

DR Pfam; PF00156; Pribosyltran; 1. 

DR TIGRFAMs; TIGR01251; ribP_PPkin; 1. 

DR PROSITE; PS00114; PRPP_SYNTHETASE; FALSE_NEG. 

KW Nucleotide biosynthesis; Transferase; Kinase; Magnesium; 

KW Complete proteome. 

FT DOMAIN 201 214 BINDING OF PHOSPHORIBOSYLPYROPHOSPHATE 

FT (POTENTIAL) . 

FT METAL 118 118 MAGNESIUM (POTENTIAL) . 

FT METAL 120 120 MAGNESIUM (POTENTIAL) . 

SQ SEQUENCE 283 AA; 30384 MW; 8 4 8CB5D1C2572 37 F CRC64; 



Query Match 62.5%; Score 5; DB 1; Length 283; 

Best Local Similarity 100.0%; Pred. No. le+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 2 VNLDA 6 

I I I I I 

Db 133 VNLDA 137 



RESULT 3 9 
G3P_LACDT 

ID G3P_LACDT STANDARD; PRT; 2 90 AA. 

AC P55070; 



DT 01-OCT-1996 (Rel. 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Glyceraldehyde 3-phosphate dehydrogenase (EC 1.2.1.12) (GAPDH) 

DE (Fragment) . 

GN GPD . 

OS Lactarius deterrimus . 

OC Eukaryota; Fungi; Basidiomycota; Hymenomycetes ; Homobasidiomycetes ; 

OC Agaricales; Russulaceae; Lactarius. 

OX NCBI_TaxID=36060; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96387764; PubMed=8 7 95234 ; 

RA Kreuzinger N . , Podeu R. , Gruber F. , Gobi F., Kubicek CP.; 

RT "Identification of some ectomycorrhizal basidiomycetes by PCR 

RT amplification of their gpd ( glyceraldehyde-3-phosphate dehydrogenase) 

RT genes."; 

RL Appl. Environ. Microbiol. 62:3432-3438(1996). 

CC -!- CATALYTIC ACTIVITY: D-glycer aldehyde 3-phosphate + phosphate + 
CC NAD ( + ) = 3-phospho-D-glyceroyl phosphate + NADH . 

CC -!- PATHWAY: Second phase of glycolysis; first step. 

CC -!- SUBUNIT: Homotetramer (By similarity). 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- SIMILARITY: Belongs to the glyceraldehyde 3-phosphate 
CC dehydrogenase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U30876; AAB38369.1; -. 

DR HSSP; P00354; 3GPD. 

DR InterPro; IPR000173; GAP_dhdrogenase . 

DR Pfam; PF00044; gpdh; 1. 

DR Pfam; PF02800; gpdh_C; 1. 

DR PRINTS; PR0007 8; G3PDHDRGNASE . 

DR PROSITE; PS00071; GAPDH; 1. 

KW Glycolysis; Oxidoreductase; NAD. 

FT NON_TER 1 1 

FT BINDING 130 130 GLYCERALDEHYDE 3-PHOSPHATE. 

FT ACT_SITE 157 157 ACTIVATES THIOL GROUP DURING CATALYSIS . 

FT N0N_TER 29 0 2 90 

SQ SEQUENCE 290 AA; 30768 MW; 4C2E77FAFDBF07CC CRC64; 



Query Match 62.5%; Score 5; DB 1; Length 290; 

Best Local Similarity 100.0%; Pred. No. l.le+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 2 VNLDA 6 

I I I I I 

Db 113 VNLDA 117 



RESULT 4 0 
G3P_BOLED 

ID G3P_BOLED STANDARD; PRT; 291 AA. 

AC Q00301; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Glyceraldehyde 3-phosphate dehydrogenase (EC 1.2.1.12) (GAPDH) 

DE (Fragment) . 

GN GPD. 

OS Boletus edulis (King bolete) . 

OC Eukaryota; Fungi; Basidiomycota; Hymenomycetes ; Homobasidiomycetes ; 

OC Boletales; Boletineae; Boletaceae; Boletus. 

OX NCBI_TaxID=36056; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96387764 ; PubMed=8795234 ; 

RA Kreuzinger N., Podeu R. , Gruber F. , Gobi F. , Kubicek CP.; 

RT "Identification of some ectomycorrhizal basidiomycetes by PCR 

RT amplification of their gpd ( glyceraldehyde-3-phosphate dehydrogenase) 

RT genes."; 

RL Appl. Environ. Microbiol. 62:3432-3438(1996). 

CC -!- CATALYTIC ACTIVITY: D-glyceraldehyde 3-phosphate + phosphate + 
CC NAD ( + ) = 3-phospho-D-glyceroyl phosphate + NADH . 

CC -!- PATHWAY: Second phase of glycolysis; first step. 

CC -!- SUBUNIT: Homotetramer (By similarity). 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- SIMILARITY: Belongs to the glyceraldehyde 3-phosphate 
CC dehydrogenase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; U30625; AAB38246.1; -. 

DR HSSP; P00357; 4GPD. 

DR InterPro; IPR000173; GAP_dhdrogenase. 

DR InterPro; IPR006424; GAPDH- I . 

DR Pfam; PF00044; gpdh; 1. 

DR Pfam; PF02 8 00; gpdh_C; 1. 

DR PRINTS; PR00078; G3PDHDRGNASE . 

DR TIGRFAMs; TIGR01534; GAPDH- I; 1. 

DR PROSITE; PS00071; GAPDH; 1. 

KW Glycolysis; Oxidoreductase ; NAD. 

FT NON_TER 1 1 

FT BINDING 130 130 GLYCERALDEHYDE 3-PHOSPHATE. 

FT ACT_SITE 157 157 ACTIVATES THIOL GROUP DURING CATALYSIS. 

FT NONJTER 291 2 91 

SQ SEQUENCE 291 AA; 30895 MW; B497315405D6D287 CRC64; 

Query Match 62.5%; Score 5; DB 1; Length 291; 

Best Local Similarity 100.0%; Pred. No. l.le+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 2 VNLDA 6 

Mill 

Db 113 VNLDA 117 

Search completed: March 26, 2004, 15:33:43 
Job time : 21.5 sees 
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ALIGNMENTS 



RESULT 1 

US- 08- 8 02-98 1-2 19 

; Sequence 219, Application US/08802981 

; Patent No. 6037137 

; GENERAL INFORMATION: 

; APPLICANT: Komoriya, Akira 

APPLICANT: Packard, Beverly S. 
; TITLE OF INVENTION: Compositions for the Detection of Enzyme 

TITLE OF INVENTION: Activity in Biological Samples and Methods of Use 



NUMBER OF SEQUENCES: 231 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Townsend and Townsend and Crew LLP 

STREET: Two Embarcadero Center, Eighth Floor 

CITY: San Francisco 

STATE: California 

COUNTRY: USA 

ZIP: 94111-3834 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8 / 8 02 , 9 8 1 

FILING DATE: 20-FEB-1997 

CLASSIFICATION: 435 
ATTORNEY/ AGENT INFORMATION: 

NAME : Hunter, Tom 

REGISTRATION NUMBER: 38,4 98 

REFERENCE/DOCKET NUMBER: 0 1 68 65- 0 0030 0US 
TELECOMMUNICATION INFORMATION: 



Thereof 



TELEPHONE: (415) 576-0200 
; TELEFAX: (415) 576-0300 

INFORMATION FOR SEQ ID NO: 219: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 9 amino acids 

TYPE: amino acid 
; STRANDEDNESS : 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
US-08-802-981-219 

Query Match 100.0%; Score 8; DB 3 ; Length 9; 

Best Local Similarity 100.0%; Pred. No. 3e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

I I I t I I I I 
Db 2 EVNLDAEF 9 



RESULT 2 

US-09-72 4-566A-52 

; Sequence 52, Application US/09724566A 
; Patent No. 6627739 
; GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF 



Anderson, John P. 
Basi, Guriqbal 
Doane, Minh Tarn 
Frigon, No. 6627739mand 
John, Varghese 
Power, Michael 
Sinha, Sukanto 
Tatsuno, Gwen 
Tung, Jay 
Wang, Shuwen 
McConlogue, Lisa 
INVENTION: Beta-Secretase Enzyme Compositions and 



TITLE OF INVENTION: Methods 
FILE REFERENCE: 22 8-US-NEWC2 

CURRENT APPLICATION NUMBER: US/ 09/724, 566A 
CURRENT FILING DATE: 2000-11-28 
PRIOR APPLICATION NUMBER: US 09/501,708 

2000-02-10 
NUMBER: 60/119,571 

1999-02-10 
NUMBER: 60/139, 172 
1999-06-15 
SEQ ID NOS: 104 
FastSEQ for Windows Version 



PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
NUMBER OF 
SOFTWARE : 
SEQ ID NO 52 

LENGTH: 9 

TYPE: PRT 

ORGANISM: Artificial 
FEATURE: 



4 . 0 



Sequence 



OTHER INFORMATION: Synthetic oligopeptide substrate 
US-09-724-566A-52 



Query Match 100.0%; Score 8; DB 4; Length 9; 

Best Local Similarity 100.0%; Pred. No. 3e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

I I I I I I I 1 
Db 2 EVNLDAEF 9 



RESULT 3 

US-09-724-566A-8 3 

; Sequence 83, Application US/09724566A 
; Patent No. 6627739 
; GENERAL INFORMATION: 



APPLICANT 


: Anderson, John P. 


APPLICANT 


: Basi, Guriqbal 


APPLICANT 


: Doane, Minh Tarn 


APPLICANT: 


: Frigon, No. 6627739mand 


APPLICANT : 


John, Varghese 


APPLICANT : 


Power, Michael 


APPLICANT: 


Sinha, Sukanto 


APPLICANT: 


Tatsuno, Gwen 


APPLICANT : 


Tung, Jay 


APPLICANT: 


Wang, Shuwen 


APPLICANT : 


McConlogue, Lisa 



TITLE OF INVENTION: Beta-Secretase Enzyme Compositions and 

TITLE OF INVENTION: Methods 
; FILE REFERENCE: 228-US-NEWC2 

CURRENT APPLICATION NUMBER: US/ 0 9 / 72 4 , 5 6 6A 

CURRENT FILING DATE: 2 000-11-2 8 

PRIOR APPLICATION NUMBER: US 09/501,708 

PRIOR FILING DATE: 2000-02-10 

PRIOR APPLICATION NUMBER: 60/119,571 

PRIOR FILING DATE: 1999-02-10 

PRIOR APPLICATION NUMBER: 60/139,172 
; PRIOR FILING DATE: 1999-06-15 

NUMBER OF SEQ ID NOS : 104 
; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 83 

LENGTH : 9 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
FEATURE: 

OTHER INFORMATION: APP fragment P5-P4'wt 
US-09-724-566A-83 

Query Match 100.0%; Score 8; DB 4; Length 9; 

Best Local Similarity 100.0%; Pred. No. 3e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 4 

US-08-659-984A-19 



; Sequence 19, Application US/08659984A 

; Patent No. 5942400 

; GENERAL INFORMATION: 

; APPLICANT: Anderson, John P. 

APPLICANT: Sinha, Sukanto 

APPLICANT: Jacobson-Croak, Kirs ten L. 
; TITLE OF INVENTION: Assays for Detecting Beta-Secretase 

TITLE OF INVENTION: Inhibition 
; NUMBER OF SEQUENCES : 21 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Townsend and Townsend and Crew LLP 

; STREET: Two Embarcadero Ctr. f 8th Floor 

; CITY: San Francisco 

; STATE: California 

; COUNTRY: USA 

ZIP: 94111-3834 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS /MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/659, 984A 
FILING DATE: 07-JUN-1996 
CLASSIFICATION: 436 
; PRIOR APPLICATION DATA: 

•; APPLICATION NUMBER: US 08/485, 152 

FILING DATE: 07-JUN-1995 
ATTORNEY/AGENT INFORMATION: 
; NAME: Heslin, James M. 

REGISTRATION NUMBER: 29,541 
; REFERENCE/DOCKET NUMBER: 1 52 7 0- 0 02 8 1 OUS 

TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-326-2400 
; TELEFAX: 415-326-2422 

; INFORMATION FOR SEQ ID NO: 19: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 10 amino acids 

; TYPE : amino acid 

; STRANDEDNESS : single 

; TOPOLOGY: linear 

MOLECULE TYPE: peptide 
; FEATURE : 

NAME/ KEY : Regi on 
; LOCATION: one-of(l) 

OTHER INFORMATION: /note= "N- terminal Ser is acetylated. 
US-08-659-9 8 4A-19 

Query Match 100.0%; Score 8; DB 2; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.004; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 5 

US-08-660-531-19 

Sequence 19, Application US/08660531 
Patent No. 6221645 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF 



Chrysler, Susanna M.S. 
Sinha, Sukanto 
Keim, Pamela S . 
Anderson, John P . 
INVENTION : Beta-Secretase 



NUMBER OF SEQUENCES: 21 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Townsend and Townsend and Crew LLP 
STREET: Two Embarcadero Ctr., 8th Floor 
CITY: San Francisco 
STATE: California 
COUNTRY: USA 
ZIP: 94111-3834 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/660, 531 
FILING DATE: 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/480,498 
FILING DATE: 07-JUN-1995 
ATTORNEY/ AGENT INFORMATION: 
NAME: Heslin, James M. 
REGISTRATION NUMBER: 29,541 
REFERENCE/DOCKET NUMBER: 152 7 0- 0 022 1 0US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-32 6-2 4 00 
TELEFAX : 415-32 6-2422 
INFORMATION FOR SEQ ID NO: 19: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 10 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 

NAME/KEY : Region 
LOCATION : one-of ( 1 ) 
OTHER INFORMATION: /note^ 
US-08-660-531-19 



tt 



N-terminal Ser is acetylated." 



Query Match 100.0%; Score 8; DB 3; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.004; 

Matches 8; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 EVNLDAEF 8 



2 EVNLDAEF 9 



RESULT 6 

US- 0 9-54 8-3 72D- 63 

; Sequence 63, Application US/09548372D 
; Patent No. 6420534 
; GENERAL INFORMATION: 

APPLICANT: GURNEY ET AL . 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 
; FILE REFERENCE: 29915/62801 

CURRENT APPLICATION NUMBER: US/ 09/ 54 8 , 372D 

CURRENT FILING DATE: 2000-04-12 
; PRIOR APPLICATION NUMBER: US 60/155,493 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 09/404,133 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US99/2 08 8 1 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 
; PRIOR FILING DATE: 1998-09-24 

NUMBER OF SEQ ID NOS : 7 3 
; SOFTWARE: Patent In version 3.1 
; SEQ ID NO 63 
; LENGTH: 10 
; TYPE: PRT 

; ORGANISM: Artificial sequence 
FEATURE : 

; OTHER INFORMATION: Synthetic peptide 
US- 0 9- 54 8 -37 2D- 63 

Query Match 100.0%; Score 8; DB 4; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.004; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 EVNLDAEF 8 

I I I I I i I I 
Db 2 EVNLDAEF 9 



RESULT 7 

US- 0 9-54 8-3 67D- 63 

; Sequence 63, Application US/09548367D 

; Patent No. 6440698 

; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL . 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 

FILE REFERENCE: 29915/6280H 
; CURRENT APPLICATION NUMBER: US / 0 9/ 54 8 , 3 67 D 
; CURRENT FILING DATE: 2000-04-12 

PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 

PRIOR FILING DATE: 1999-09-23 



PRIOR APPLICATION NUMBER: PCT/US 99/2 08 8 1 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 

PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 63 
LENGTH: 10 
; TYPE: PRT 

; ORGANISM: Artificial sequence 
FEATURE : 

OTHER INFORMATION: Synthetic peptide 
US-09-548-367D-63 

Query Match 100.0%; Score 8; DB 4; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.004; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 8 

US-0 9-551-853D-63 

; Sequence 63, Application US/09551853D 
; Patent No. 6500667 
; GENERAL INFORMATION: 

APPLICANT: GURNEY ET AL . 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 

FILE REFERENCE: 29915/6280L 
; CURRENT APPLICATION NUMBER: US / 0 9 / 55 1 , 8 5 3D 

CURRENT FILING DATE: 2000-04-18 

PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US 9 9/2 0 8 8 1 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 

PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS: 73 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 63 
LENGTH: 10 
TYPE: PRT 

ORGANISM: Artificial sequence 
; FEATURE : 

OTHER INFORMATION: Synthetic peptide 
US-09-551-853D-63 

Query Match 100.0%; Score 8; DB 4; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.004; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 EVNLDAEF 8 

I I ! 1 IE I I 

Db 2 EVNLDAEF 9 



RESULT 9 
US-09-604-608-5 

; Sequence 5, Application US/09604608 

; Patent No. 6545127 

; GENERAL INFORMATION: 

; APPLICANT: Tang, Jordan J.N. 

APPLICANT: Lin, Xinli 

APPLICANT: Koelsch, Gerald 

TITLE OF INVENTION: Catalyti cally Active Recombinant Memapsin and Methods 
; TITLE OF INVENTION: of Use Thereof 
FILE REFERENCE: OMRF 17 9 

CURRENT APPLICATION NUMBER: US/ 09/ 604 , 60 8 
; CURRENT FILING DATE: 2000-06-27 
; PRIOR APPLICATION NUMBER: 60/141,363 

PRIOR FILING DATE: 1999-06-28 

PRIOR APPLICATION NUMBER: 60/168,060 

PRIOR FILING DATE: 1999-11-30 
; PRIOR APPLICATION NUMBER: 60/177,836 
; PRIOR FILING DATE: 2000-01-25 

PRIOR APPLICATION NUMBER: 60/178,368 

PRIOR FILING DATE: 2000-01-27 

PRIOR APPLICATION NUMBER: 60/210,292 

PRIOR FILING DATE: 2000-06-08 

NUMBER OF SEQ ID NOS : 31 

SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 5 
; LENGTH: 10 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence: Synthetic 
OTHER INFORMATION: Peptide 
US-09-604-608-5 

Query Match 100.0%; Score 8; DB 4; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.004; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I M 
Db 2 EVNLDAEF 9 



RESULT 10 
PCT-US94-07 043A-3 

; Sequence 3, Application PC/TUS94 07 04 3A 

GENERAL INFORMATION: 
; APPLICANT: Tamburini, Paul P.; Benz, G nter; H bich, 

; APPLICANT: Dieter; Dreyer, Robert N . ; Koenig, Gerhard 

TITLE OF INVENTION: CATHEPSIN D IS AN AMYLOIDOGENIC 
TITLE OF INVENTION: PROTEASE IN ALZHEIMER S DISEASE 
NUMBER OF SEQUENCES: 11 



; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Miles Inc. 

STREET: 400 Morgan Lane 

CITY: West Haven 
; STATE : Connecticut 

COUNTRY: USA 

ZIP: 06516 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Diskette, 3.50 inch, 800 kb storage 

; COMPUTER: Sharp PC 4600 

OPERATING SYSTEM: MS-DOS 

SOFTWARE: WordPerfect 5.1 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: PCT/US94/07043A 

FILING DATE: June 21, 1994 

CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US93/108 8 9 

FILING DATE: November 12, 1993 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/995,660 

FILING DATE: December 16, 1992 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/880,914 
; FILING DATE: May 11, 1992 

; ATTORNEY/AGENT INFORMATION: 

; NAME: Pamela A. Simonton 

REGISTRATION NUMBER: 31,060 

REFERENCE/DOCKET NUMBER: MTI 224.3 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (203) 937-2340 

TELEFAX : (203) 937-2795 
INFORMATION FOR SEQ ID NO: 3: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 11 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
PCT-US94-07 04 3A-3 

Query Match 100.0%; Score 8; DB 5; Length 11; 

Best Local Similarity 100.0%; Pred. No. 0.0044; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

I ! I I I I I I 

Db 3 EVNLDAEF 10 



RESULT 11 
US-08-659-9 8 4A-18 

; Sequence 18, Application US/08659984A 

; Patent No. 5942400 

; GENERAL INFORMATION : 

; APPLICANT: Anderson, John P. 

; APPLICANT: Sinha, Sukanto 

APPLICANT: Jacobson-Croak, Kirsten L. 

TITLE OF INVENTION: Assays for Detecting Beta-Secretase 



; TITLE OF INVENTION: Inhibition 

; NUMBER OF SEQUENCES: 21 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: Townsend and Townsend and Crew LLP 
; STREET: Two Embarcadero Ctr., 8th Floor 

; CITY: San Francisco 

; STATE: California 

COUNTRY: USA 
; ZIP: 94111-3834 

; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8/659, 984A 

FILING DATE: 07-JUN-1996 
; CLASSIFICATION: 436 

; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: US 08/485,152 

; FILING DATE: 07-JUN-1995 

; ATTORNEY/AGENT INFORMATION: 

; NAME: Heslin, James M. 

REGISTRATION NUMBER: 2 9,541 

REFERENCE/ DOCKET NUMBER: 152 7 0- 0 02 8 1 OUS 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-326-2400 
; TELEFAX: 415-326-2422 

INFORMATION FOR SEQ ID NO: 18: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 21 amino acids 

; TYPE: amino acid 

STRANDEDNESS : single 
; TOPOLOGY: linear 

; MOLECULE TYPE: peptide 

US-08-659-984A-18 

Query Match 100.0%; Score 8; DB 2; Length 21; 

Best Local Similarity 100.0%; Pred. No. 0.0081; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

MINIM 

Db 2 EVNLDAEF 9 



RESULT 12 

US-08-802-981-112 

; Sequence 112, Application US/08802981 
; Patent No. 6037137 

GENERAL INFORMATION: 

APPLICANT: Komoriya, Akira 
; APPLICANT: Packard, Beverly S. 

TITLE OF INVENTION: Compositions for the Detection of Enzyme 

TITLE OF INVENTION: Activity in Biological Samples and Methods of Use 
Thereof 

NUMBER OF SEQUENCES: 231 



CORRESPONDENCE ADDRESS: 

ADDRESSEE: Townsend and Townsend and Crew LLP 
; STREET: Two Embarcadero Center, Eighth Floor 

; CITY: San Francisco 

; STATE: California 

COUNTRY: USA 

ZIP: 94111-3834 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/ 0 8/ 8 02 , 98 1 

FILING DATE: 20-FEB-1997 
; CLASSIFICATION: 435 

ATTORNEY/AGENT INFORMATION: 

NAME: Hunter, Tom 

REGISTRATION NUMBER: 3 8,4 98 

REFERENCE/DOCKET NUMBER: 0168 65- 00 0300US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: {415} 576-0200 
TELEFAX: (415) 576-0300 
INFORMATION FOR SEQ ID NO: 112: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 21 amino acids 
; TYPE: amino acid 

STRANDEDNESS : 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
; FEATURE : 

; NAME/KEY: Modi fied- site 

; LOCATION: 3 

OTHER INFORMATION: /product= "Aib" 
FEATURE: 

; NAME/KEY: Modif ied-site 

; LOCATION: 4 

OTHER INFORMATION: /product^ "Acp" 
; FEATURE : 

NAME/ KEY : Modif ied-site 
; LOCATION: 16 

OTHER INFORMATION: /product= "Acp" 
US-08-802-981-112 

Query Match 100.0%; Score 8; DB 3; Length 21; 

Best Local Similarity 100.0%; Pred. No. 0.0081; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

MINIM 
Db 7 EVNLDAEF 14 



RESULT 13 
US-08-660-531-18 

; Sequence 18, Application US/08660531 
; Patent No. 6221645 



GENERAL INFORMATION 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Chrysler, Susanna M. S . 
Sinha, Sukanto 
Keim, Pamela S . 
Anderson, John P. 
TITLE OF INVENTION: Beta-Secretase 
NUMBER OF SEQUENCES: 21 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Townsend and Townsend and Crew LLP 
STREET: Two Embarcadero Ctr., 8th Floor 
CITY: San Francisco 
STATE: California 
COUNTRY: USA 
ZIP: 94111-3834 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS /MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/660,531 
FILING DATE: 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/480,498 
FILING DATE: 07-JUN-1995 
ATTORNEY/AGENT INFORMATION: 
NAME: Heslin, James M. 
REGISTRATION NUMBER: 2 9,541 
REFERENCE/DOCKET NUMBER: 1 52 7 0- 0 022 1 OUS 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-32 6-2 4 00 
TELEFAX: 415-32 6-2422 
INFORMATION FOR SEQ ID NO: 18: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 21 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
US-08-660-531-18 

Query Match 100.0%; Score 8; DB 3; Length 21; 

Best Local Similarity 100.0%; Pred. No. 0.0081; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I i I I I 
Db 2 EVNLDAEF 9 



RESULT 14 
US-08-659-984A-17 

; Sequence 17, Application US/08659984A 
; Patent No. 5942400 

GENERAL INFORMATION: 
; APPLICANT: Anderson, John P. 



APPLICANT: Sinha, Sukanto 
; APPLICANT: Jacobson-Croak, Kirs ten L. 

; TITLE OF INVENTION: Assays for Detecting Beta-Secretase 

TITLE OF INVENTION: Inhibition 

NUMBER OF SEQUENCES: 21 
; CORRESPONDENCE ADDRESS : 

; ADDRESSEE: Townsend and Towns end and Crew LLP 

; STREET: Two Embarcadero Ctr. , 8th Floor 

; CITY: San Francisco 

STATE: California 

COUNTRY: USA 

ZIP: 94111-3834 
; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/ 659, 98 4 A 

FILING DATE: 07-JUN-1996 

CLASSIFICATION: 436 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/485,152 
; FILING DATE: 07-JUN-1995 

ATTORNEY/AGENT INFORMATION : 
; NAME: Heslin, James M. 

REGISTRATION NUMBER: 29,541 
; REFERENCE/DOCKET NUMBER: 152 7 0- 0 02 8 1 0US 

TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-326-2400 
; TELEFAX: 415-326-2422 

INFORMATION FOR SEQ ID NO: 17: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 3 0 amino acids 

; TYPE: amino acid 

STRANDEDNESS : single 
; TOPOLOGY: linear 

MOLECULE TYPE: peptide 
US-08-659-984A-17 

Query Match 100.0%; Score 8; DB 2 ; Length 30; 

Best Local Similarity 100.0%; Pred. No. 0.011; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

MINIM 

Db 23 EVNLDAEF 3 0 



RESULT 15 
US-08-660-531-17 

; Sequence 17, Application US/08660531 

; Patent No. 6221645 

GENERAL INFORMATION: 

APPLICANT: Chrysler, Susanna M.S. 
APPLICANT: Sinha, Sukanto 
APPLICANT: Keim, Pamela S. 



; APPLICANT: Anderson, John P. 

; TITLE OF INVENTION: Beta-Secretase 

; NUMBER OF SEQUENCES : 21 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Townsend and Townsend and Crew LLP 

STREET: Two Embarcadero Ctr., 8th Floor 

; CITY: San Francisco 

STATE: California 

; COUNTRY: USA 

; ZIP: 94111-3834 

; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS /MS-DOS 

; SOFTWARE: Patentln Release #1.0, Version #1.25 

CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/ 0 8 / 66 0 , 53 1 

FILING DATE: 
CLASSIFICATION: 435 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/480,498 
FILING DATE: 07-JUN-1995 
ATTORNEY/AGENT INFORMATION: 
; NAME: Heslin, James M. 

REGISTRATION NUMBER: 29,541 
REFERENCE/DOCKET NUMBER: 152 7 0- 0 022 1 OUS 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-326-2400 
TELEFAX: 415-326-2422 
INFORMATION FOR SEQ ID NO: 17: 
SEQUENCE CHARACTERISTICS: 
; LENGTH : 30 amino acids 

; TYPE: amino acid 

; STRANDEDNESS : single 

; TOPOLOGY: linear 

; MOLECULE TYPE: peptide 

US-08-660-531-17 

Query Match 100.0%; Score 8; DB 3; Length 30; 

Best Local Similarity 100.0%; Pred. No. 0.011; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

I I I I II I I 
Db 2 3 EVNLDAEF 30 



RESULT 16 
US-09-724-566A-53 

Sequence 53, Application US/09724566A 
Patent No. 6627739 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Anderson, John P. 
Basi, Guriqbal 
Doane, Minh Tarn 
Frigon, No. 6627739mand 
John, Varghese 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Power, Michael 
Sinha, Sukanto 
Tatsuno, Gwen 
Tung, Jay 
Wang, Shuwen 
McConlogue, Lisa 
TITLE OF INVENTION: Beta-Secretase Enzyme Compositions and 
TITLE OF INVENTION: Methods 
FILE REFERENCE: 228-US-NEWC2 

CURRENT APPLICATION NUMBER: US/09/724 , 566A 
CURRENT FILING DATE: 2000-11-28 
PRIOR APPLICATION NUMBER: US 09/501,708 
PRIOR FILING DATE: 2000-02-10 
PRIOR APPLICATION NUMBER: 60/119,571 
PRIOR FILING DATE: 1999-02-10 
PRIOR APPLICATION NUMBER: 60/139,172 
PRIOR FILING DATE: 1999-06-15 
NUMBER OF SEQ ID NOS : 104 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 53 
LENGTH: 3 0 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Synthetic oligopeptide substrate 
US- 09-72 4-566A-53 

Query Match 100.0%; Score 8; DB 4; Length 30; 

Best Local Similarity 100.0%; Pred. No. 0.011; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 23 EVNLDAEF 30 



RESULT 17 
US-08-659-984A-16 

; Sequence 16, Application US/08659984A 
; Patent No. 5942400 

GENERAL INFORMATION: 

APPLICANT: Anderson, John P. 
; APPLICANT: Sinha, Sukanto 

; APPLICANT: Jacobson-Croak, Kirs ten L. 

TITLE OF INVENTION: Assays for Detecting Beta-Secretase 
; TITLE OF INVENTION: Inhibition 

; NUMBER OF SEQUENCES: 21 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Townsend and Townsend and Crew LLP 

; STREET: Two Embarcadero Ctr., 8th Floor 

CITY: San Francisco 

STATE: California 

COUNTRY: USA 

ZIP: 94111-3834 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 



; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US / 0 8 / 65 9 , 9 8 4A 
FILING DATE: 07-JUN-1996 
CLASSIFICATION: 436 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/485,152 
FILING DATE: 07-JUN-1995 
ATTORNEY/AGENT INFORMATION : 
; NAME: Heslin, James M. 

REGISTRATION NUMBER: 29,541 
REFERENCE/DOCKET NUMBER: 1527 0-002 8 10US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-326-2400 
TELEFAX: 415-326-2422 
INFORMATION FOR SEQ ID NO: 16: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 33 amino acids 

; TYPE: amino acid 

STRANDEDNESS : single 
TOPOLOGY: linear 
; MOLECULE TYPE: peptide 

US-0 8-65 9-9 8 4A-16 



Query Match 100.0%; Score 8; DB 2; Length 33; 

Best Local Similarity 100.0%; Pred. No. 0.012; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 EVNLDAEF 8 

I I I II I I I 
Db 14 EVNLDAEF 21 



RESULT 18 
US-08-660-531-16 

Sequence 16, Application US/08660531 
Patent No. 6221645 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Chrysler, Susanna M.S. 
Sinha, Sukanto 
Keim, Pamela S. 
Anderson, John P. 
TITLE OF INVENTION: Beta-Secretase 
NUMBER OF SEQUENCES: 21 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Townsend and Townsend and Crew LLP 
STREET: Two Embarcadero Ctr . , 8th Floor 
CITY: San Francisco 
STATE: California 
COUNTRY: USA 
ZIP: 94111-3834 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 



CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/ 0 8/ 660 , 531 

FILING DATE: 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: US 08/480,498 

FILING DATE: 07-JUN-1995 
ATTORNEY/ AGENT INFORMATION: 
; NAME: Heslin, James M. 

REGISTRATION NUMBER: 29,541 
; REFERENCE/ DOCKET NUMBER: 15270-002210US 

TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 415-326-2400 

TELEFAX : 415-326-2422 
INFORMATION FOR SEQ ID NO: 16: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 3 3 amino acids 

TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
; MOLECULE TYPE: peptide 

US-08-660-531-16 

Query Match 100.0%; Score 8; DB 3; Length 33; 

Best Local Similarity 100.0%; Pred. No. 0.012; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

I I I I i I I I 
Db 14 EVNLDAEF 21 



RESULT 19 

US- 0 9-724-5 66A-63 

; Sequence 63, Application US/09724566A 
; Patent No. 6627739 
; GENERAL INFORMATION: 



APPLICANT 


: Anderson, John P. 


APPLICANT: 


: Basi, Guriqbal 


APPLICANT: 


: Doane, Minh Tarn 


APPLICANT: 


: Frigon, No. 6627739mand 


APPLICANT : 


: John, Varghese 


APPLICANT: 


: Power, Michael 


APPLICANT: 


: Sinha, Sukanto 


APPLICANT: 


: Tatsuno, Gwen 


APPLICANT : 


: Tung, Jay 


APPLICANT: 


Wang, Shuwen 


APPLICANT : 


McConlogue, Lisa 



TITLE OF INVENTION: Beta- Secretas e Enzyme Compositions and 
TITLE OF INVENTION: Methods 
FILE REFERENCE: 228-US-NEWC2 

CURRENT APPLICATION NUMBER: US/ 09/72 4, 5 6 6A 
CURRENT FILING DATE: 2000-11-28 
PRIOR APPLICATION NUMBER: US 09/501,708 
PRIOR FILING DATE: 2000-02-10 
PRIOR APPLICATION NUMBER: 60/119,571 
; PRIOR FILING DATE: 1999-02-10 



PRIOR APPLICATION NUMBER: 60/139,172 
PRIOR FILING DATE: 1999-06-15 
NUMBER OF SEQ ID NOS : 104 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 63 

LENGTH: 33 

TYPE : PRT 
; ORGANISM: Homo sapiens 

FEATURE: 

OTHER INFORMATION: P2 6-P4'sw peptide substrate 
US- 09-72 4-56 6A-63 

Query Match 100.0%; Score 8; DB 4; Length 33; 

Best Local Similarity 100.0%; Pred. No. 0.012; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

I II I I I II 
Db 2 6 EVNLDAEF 33 



RESULT 2 0 
US-08-659-9 84A-15 

; Sequence 15, Application US/08659984A 
; Patent No. 5942400 

GENERAL INFORMATION: 
; APPLICANT: Anderson, John P. 

APPLICANT: Sinha, Sukanto 

APPLICANT: Jacobson-Croak, Kirsten L. 

TITLE OF INVENTION: Assays for Detecting Beta-Secretase 
TITLE OF INVENTION: Inhibition 
; NUMBER OF SEQUENCES: 21 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: Townsend and Townsend and Crew LLP 
; STREET: Two Embarcadero Ctr., 8th Floor 

; CITY: San Francisco 

STATE: California 
COUNTRY: USA 
ZIP: 94111-3834 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: Patentln Release #1.0, Version #1.25 

CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/659, 984A 

FILING DATE: 07-JUN-1996 

CLASSIFICATION: 436 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/485,152 
; FILING DATE: 07-JUN-1995 

; ATTORNEY/AGENT INFORMATION: 

NAME: Heslin, James M. 

REGISTRATION NUMBER: 29,541 

REFERENCE/DOCKET NUMBER: 1 52 7 0- 0 02 8 1 0US 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-326-2400 



TELEFAX: 415-326-2422 
; INFORMATION FOR SEQ ID NO: 15: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 42 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
; TOPOLOGY: linear 

MOLECULE TYPE: peptide 
US-0 8-65 9-984A-15 



Query Match 100.0%; Score 8; DB 2; Length 42; 

Best Local Similarity 100.0%; Pred. No. 0.015; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 23 EVNLDAEF 30 



RESULT 21 
US-08-660-531-15 

Sequence 15, Application US/08660531 
Patent No. 6221645 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Chrysler, Susanna M.S. 
Sinha, Sukanto 
Keim, Pamela S . 
Anderson, John P. 
TITLE OF INVENTION: Beta-Secretase 
NUMBER OF SEQUENCES: 21 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Townsend and Townsend and Crew LLP 
STREET: Two Embarcadero Ctr . , 8th Floor 
CITY: San Francisco 
STATE : California 
COUNTRY: USA 
ZIP: 94111-3834 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS /MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US / 0 8 / 660 , 5 3 1 

FILING DATE: 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/480,498 

FILING DATE: 07-JUN-1995 
ATTORNEY/AGENT INFORMATION: 

NAME: Heslin, James M. 

REGISTRATION NUMBER: 29,541 

REFERENCE/DOCKET NUMBER: 1 52 7 0- 0 02 2 1 0US 
TELECOMMUNICATION INFORMATION: 

TELEPHONE : 415-326-2400 

TELEFAX: 415-32 6-2 422 
INFORMATION FOR SEQ ID NO: 15: 



SEQUENCE CHARACTERISTICS: 
LENGTH: 42 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
US-08-660-531-15 

Query Match 100.0%; Score 8; DB 3 ; Length 42; 

Best Local Similarity 100.0%; Pred. No. 0.015; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 23 EVNLDAEF 3 0 



RESULT 22 

US- 0 8-339-7 0 8A- 12 

Sequence 12, Application US/08339708A 
Patent No. 6037521 
GENERAL INFORMATION : 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Sato, Masahiro 
Takashi, Kobayashi 
Tada, No. 6037521ihiro 
Shoji, Mikio 
Kawarabayashi, Takeshi 
TITLE OF INVENTION: TRANSGENIC ANIMAL MODEL FOR ALZHEIMER'S 
TITLE OF INVENTION: DISEASE 
NUMBER OF SEQUENCES: 23 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: Foley & Lardner 
STREET: 3000 K Street, N.W., Suite 500 
CITY: Washington 
STATE: D.C. 
COUNTRY: USA 
ZIP : 20007-5109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLI CAT I ON DATA : 

APPLICATION NUMBER: US/ 0 8 / 33 9 , 7 0 8A 

FILING DATE: 14-NOV-1994 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: JP 306026/93 

FILING DATE: 12-NOV-1993 
ATTORNEY/AGENT INFORMATION: 

NAME: SANDERCOCK, COLIN G. 

REGISTRATION NUMBER: 31,298 

REFERENCE/ DOCKET NUMBER: 026083/0159 
TELECOMMUNICATION INFORMATION : 

TELEPHONE: (202) 672-5300 

TELEFAX: (202) 672-5399 

TELEX: 904136 
INFORMATION FOR SEQ ID NO: 12: 



SEQUENCE CHARACTERISTICS: 
; LENGTH: 103 amino acids 

; TYPE: amino acid 

TOPOLOGY: linear 
; MOLECULE TYPE: protein 

US-08-339-708A-12 

Query Match 100.0%; Score 8; DB 3; Length 103; 

Best Local Similarity 100.0%; Pred. No. 0.036; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 1 EVNLDAEF 8 



RESULT 23 

US-0 8-65 9-9 8 4A-21 

; Sequence 21, Application US/08659984A 
; Patent No. 5942400 
; GENERAL INFORMATION: 
; APPLICANT: Anderson, John P. 

APPLICANT: Sinha, Sukanto 
APPLICANT: Jacobson-Croak, Kirsten L. 
; TITLE OF INVENTION: Assays for Detecting Beta-Secretase 

TITLE OF INVENTION: Inhibition 
NUMBER OF SEQUENCES: 21 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Townsend and Townsend and Crew LLP 

; STREET: Two Embarcadero Ctr., 8th Floor 

CITY: San Francisco 
; STATE: California 

; COUNTRY: USA 

ZIP: 94111-3834 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.25 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8/ 65 9 , 9 8 4 A 
FILING DATE: 07-JUN-1996 
CLASSIFICATION: 436 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/485,152 
FILING DATE: 07-JUN-1995 
ATTORNEY/AGENT INFORMATION: 
; NAME: Heslin, James M. 

; REGISTRATION NUMBER: 29,541 

REFERENCE/ DOCKET NUMBER: 15270-002810US 
; TELECOMMUNICATION INFORMATION: 

; TELEPHONE: 415-326-2400 

TELEFAX: 415-326-2422 
INFORMATION FOR SEQ ID NO: 21: 
SEQUENCE CHARACTERISTICS : 
LENGTH: 5 06 amino acids 
; TYPE: amino acid 



; TOPOLOGY: linear 

; MOLECULE TYPE: protein 

US-08-659-98 4A-21 



Query Match 100.0%; Score 8; DB 2; Length 506; 

Best Local Similarity 100.0%; Pred. No. 0.16; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I i I I I I I 
Db 404 EVNLDAEF 411 



RESULT 2 4 
US-08-660-531-21 

Sequence 21, Application US/08660531 
Patent No. 6221645 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Chrysler, Susanna M.S. 
Sinha, Sukanto 
Keim, Pamela S. 
Anderson, John P. 
TITLE OF INVENTION: Beta-Secretase 
NUMBER OF SEQUENCES: 21 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Townsend and Townsend and Crew LLP 
STREET: Two Embarcadero Ctr., 8th Floor 
CITY: San Francisco 
STATE: California 
COUNTRY: USA 
ZIP: 94111-3834 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/660, 531 
FILING DATE: 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/480,498 
FILING DATE: 07-JUN-1995 
ATTORNEY/AGENT INFORMATION : 
NAME: Heslin, James M. 
REGISTRATION NUMBER: 29,541 
REFERENCE/DOCKET NUMBER: 152 7 0- 0 02 2 1 0US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-32 6-2 4 00 
TELEFAX: 415-32 6-2 422 
INFORMATION FOR SEQ ID NO: 21: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 506 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-660-531-21 



Query Match 100.0%; Score 8; DB 3; Length 506; 

Best Local Similarity 100.0%; Pred. No. 0.16; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

I I I I E I I E 
Db 404 EVNLDAEF 411 



RESULT 2 5 
US-09-054-334-4 

; Sequence 4, Application US/09054334 
; Patent No. 6329163 

GENERAL INFORMATION: 

APPLICANT: Anderson, John P. 
; APPLICANT: Jacobson-Croak, Kirsten L. 

APPLICANT: Sinha, Sukanto 
; TITLE OF INVENTION: Assays for Detecting Beta-Secretase 

; TITLE OF INVENTION: Inhibition 

; NUMBER OF SEQUENCES: 6 

CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Townsend and Townsend and Crew LLP 

; STREET: Two Embarcader Center, Eighth Floor 

; CITY: San Francisco 

; STATE: California 

COUNTRY: USA 
ZIP: 94111-3834 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.30 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 054 , 334 
; FILING DATE: 02-APR-1998 

CLASSIFICATION: 435 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: US 08/485,152 

FILING DATE: 07-JUN-1995 
ATTORNEY/ AGENT INFORMATION: 
; NAME: Heslin, James M. 

REGISTRATION NUMBER: 29,541 

REFERENCE/ DOCKET NUMBER: 015270-002820US 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 576-0200 

TELEFAX: (415) 576-0300 
; INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 506 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
; MOLECULE TYPE: protein 

US-09-054-334-4 



Query Match 100.0%; Score 8; DB 4; Length 506; 

Best Local Similarity 100.0%; Pred. No. 0.16; 



Matches 



8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 EVNLDAEF 8 

I I I I I I ! I 
Db 4 04 EVNLDAEF 411 



RESULT 2 6 

US- 0 9- 54 8- 372 D- 12 

; Sequence 12, Application US/09548372D 
; Patent No. 6420534 
; GENERAL INFORMATION: 

APPLICANT: GURNEY ET AL . 
; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 
; FILE REFERENCE: 29915/62801 

CURRENT APPLICATION NUMBER: US / 0 9/ 5 4 8 , 37 2D 

CURRENT FILING DATE: 2000-04-12 

PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: PCT/US 9 9/ 2 0 8 8 1 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 

PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 12 
LENGTH: 695 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-548-372D-12 

Query Match 100.0%; Score 8; DB 4; Length 695; 

Best Local Similarity 100.0%; Pred. No. 0.21; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 5 93 EVNLDAEF 60 0 



RESULT 27 
US-09-548-367D-12 

; Sequence 12, Application US/09548367D 
; Patent No. 6440698 
; GENERAL INFORMATION: 

APPLICANT: GURNEY ET AL . 
; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 
FILE REFERENCE: 29915/6280H 

CURRENT APPLICATION NUMBER: US/ 0 9/ 5 4 8 , 3 67 D 

CURRENT FILING DATE: 2000-04-12 

PRIOR APPLICATION NUMBER: US 60/155,493 



PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: PCT/US9 9/2 0 8 8 1 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 
; PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 12 
LENGTH: 695 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-548-367D-12 

Query Match 100.0%; Score 8; DB 4; Length 695; 

Best Local Similarity 100.0%; Pred. No. 0.21; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 593 EVNLDAEF 600 



RESULT 2 8 
US-09-551-853D-12 

; Sequence 12, Application US/09551853D 
; Patent No. 6500667 
; GENERAL INFORMATION: 

APPLICANT : GURNEY ET AL . 
; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 

FILE REFERENCE: 29915/6280L 

CURRENT APPLICATION NUMBER: US/09/551, 853D 

CURRENT FILING DATE: 2000-04-18 

PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US 9 9 / 2 0 8 8 1 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 

PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS: 73 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 12 
LENGTH: 695 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-551-8 53D-12 

Query Match 100.0%; Score 8; DB 4; Length 695; 

Best Local Similarity 100.0%; Pred. No. 0.21; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 



1 EVNLDAEF 8 



Db 593 EVNLDAEF 60 0 



RESULT 2 9 
US-09-548-372D-18 

; Sequence 18, Application US/09548372D 

; Patent No. 6420534 

; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL . 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 

FILE REFERENCE: 29915/62801 
; CURRENT APPLICATION NUMBER: US/09/54 8 , 372D 
; CURRENT FILING DATE: 2000-04-12 

PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: PCT/US99/2088 1 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 60/101,594 

PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 

SOFTWARE : Patentln version 3,1 
; SEQ ID NO 18 
; LENGTH: 69 7 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-548-372D-18 

Query Match 100.0%; Score 8; DB 4 ; Length 697; 

Best Local Similarity 100.0%; Pred. No. 0.21; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 EVNLDAEF 8 

I I I I 1 I I I 
Db 5 93 EVNLDAEF 60 0 



RESULT 3 0 
US-09-548-367D-18 

; Sequence 18, Application US/09548367D 

; Patent No. 6440698 

; GENERAL INFORMATION : 

; APPLICANT: GURNEY ET AL . 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 

FILE REFERENCE: 29915/6280H 
; CURRENT APPLICATION NUMBER: US/ 0 9 / 5 4 8 , 3 67 D 
; CURRENT FILING DATE: 2000-04-12 
; PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 

PRIOR FILING DATE: 1999-09-23 



PRIOR APPLICATION NUMBER: PCT/US99/20881 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 60/101,594 

PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 18 
LENGTH: 697 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-54 8-3 67D-18 

Query Match 100.0%; Score 8; DB 4; Length 697; 

Best Local Similarity 100.0%; Pred. No. 0.21; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I II I I I 
Db 593 EVNLDAEF 600 



RESULT 31 
US-09-551-853D-18 

; Sequence 18, Application US/09551853D 

; Patent No. 6500667 

; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL . 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 
FILE REFERENCE: 29915/6280L 

CURRENT APPLICATION NUMBER: US/ 0 9/ 5 5 1 , 8 53D 

CURRENT FILING DATE: 2000-04-18 

PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 09/404,133 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: PCT/US 99/2 0 8 8 1 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 

PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS: 73 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 18 

LENGTH: 697 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-551-853D-18 

Query Match 100.0%; Score 8; DB 4; Length 697; 

Best Local Similarity 100.0%; Pred. No. 0.21; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 593 EVNLDAEF 600 



RESULT 32 

US-09-2 52-991A-24809 

; Sequence 24809, Application US/09252991A 
; Patent No. 6551795 
; GENERAL INFORMATION: 

; APPLICANT: Marc J. Rubenfield et al . 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
PSEUDOMONAS 

TITLE OF INVENTION: AERUGINOSA FOR DIAGNOSTICS AND THERAPEUTICS 
FILE REFERENCE: 107196.136 

CURRENT APPLICATION NUMBER: US/0 9/252 , 991A 

CURRENT FILING DATE: 1999-02-18 
; PRIOR APPLICATION NUMBER: US 60/074,788 

PRIOR FILING DATE: 1998-02-18 

PRIOR APPLICATION NUMBER: US 60/094,190 

PRIOR FILING DATE: 1998-07-27 
; NUMBER OF SEQ ID NOS : 33142 
; SEQ ID NO 24809 
LENGTH: 371 
TYPE: PRT 

; ORGANISM: Pseudomonas aeruginosa 
US-09-2 52-991A-248 09 

Query Match 75.0%; Score 6; DB 4 ; Length 371; 

Best Local Similarity 100.0%; Pred. No. 19; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 EVNLDA 6 

HUM 

Db 184 EVNLDA 189 



RESULT 33 
US-08-480-498-2 

Sequence 2, Application US/08480498 
Patent No. 5744346 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Chrysler, Susanna M.S. 
Sinha, Sukanto 
Keim, Pamela S. 
Anderson, John P. 
TITLE OF INVENTION: Beta-Secretase 
NUMBER OF SEQUENCES: 3 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Townsend and Townsend Khourie and Crew 
STREET: One Market Plaza, Steuart Tower, Suite 2000 
CITY: San Francisco 
STATE: California 
COUNTRY: USA 
ZIP: 94105 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 



; APPLICATION NUMBER: US/ 08/ 4 8 0 , 4 98 

; FILING DATE: 

CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
; NAME: Heslin, James M. 

REGISTRATION NUMBER: 29,541 
REFERENCE/ DOCKET NUMBER: 01527 0-0 022 00 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-326-2400 
; TELEFAX: 415-326-2422 

INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS : 
; LENGTH: 5 amino acids 

; TYPE: amino acid 

STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
US-08-480-498-2 

Query Match 62.5%; Score 5; DB 1; Length 5 ; 

Best Local Similarity 100.0%; Pred. No. 3e+05; 

Matches 5; Conservative 0; Mismatches 0; Indels 

Qy 2 VNLDA 6 

MM! 

Db 1 VNLDA 5 



RESULT 34 
US-08-659-984A-14 

; Sequence 14, Application US/08659984A 
; Patent No. 5942400 

GENERAL INFORMATION: 
; APPLICANT: Anderson, John P. 

; APPLICANT: Sinha, Sukanto 

; APPLICANT: Jacobson-Croak, Kirsten L. 

TITLE OF INVENTION: Assays for Detecting Beta-Secretase 
; TITLE OF INVENTION: Inhibition 

; NUMBER OF SEQUENCES: 21 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: Townsend and Townsend and Crew LLP 

STREET: Two Embarcadero Ctr., 8th Floor 

CITY: San Francisco 
; STATE: California 

COUNTRY: USA 

ZIP: 94111-3834 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8 / 65 9 , 9 8 4A 

FILING DATE: 07-JUN-1996 

CLASSIFICATION: 436 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/485,152 



FILING DATE: 07-JUN-1995 
ATTORNEY/ AGENT INFORMATION: 
; NAME: Heslin, James M. 

REGISTRATION NUMBER: 29,541 
REFERENCE/ DOCKET NUMBER: 1527 0-0 02 8 1 OUS 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-326-2400 
; TELEFAX: 415-326-2422 

; INFORMATION FOR SEQ ID NO: 14: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 5 amino acids 
; TYPE: amino acid 

; STRANDEDNESS : single 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
US-0 8-65 9-9 8 4A-14 

Query Match 62.5%; Score 5; DB 2 ; Length 5; 

Best Local Similarity 100.0%; Pred. No. 3e+05; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDA 6 

I I I I I 

Db 1 VNLDA 5 



RESULT 35 
US-08-660-531-14 

; Sequence 14, Application US/08660531 

; Patent No. 6221645 

; GENERAL INFORMATION: 

APPLICANT: Chrysler, Susanna M.S. 

APPLICANT: Sinha, Sukanto 
; APPLICANT: Keim, Pamela S. 

; APPLICANT: Anderson, John P. 

; TITLE OF INVENTION: Beta-Secretase 

NUMBER OF SEQUENCES: 21 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Townsend and Townsend and Crew LLP 

STREET: Two Embarcadero Ctr., 8th Floor 
; CITY: San Francisco 

STATE: California 
COUNTRY: USA 
ZIP: 94111-3834 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.25 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US / 0 8 / 66 0 , 5 3 1 
; FILING DATE: 

; CLASSIFICATION: 435 

PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: US 08/480,498 

; FILING DATE: 07-JUN-1995 

ATTORNEY/ AGENT INFORMATION: 



; NAME: Heslin, James M . 

REGISTRATION NUMBER: 29,541 
REFERENCE/ DOCKET NUMBER: 15270-002210US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-326-2400 
; TELEFAX: 415-326-2422 

; INFORMATION FOR SEQ ID NO: 14: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 5 amino acids 

TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
US-08-660-531-14 

Query Match 62.5%; Score 5; DB 3 ; Length 5; 

Best Local Similarity 100.0%; Pred. No. 3e+05; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDA 6 

I I I I E 

Db 1 VNLDA 5 



RESULT 3 6 
US-09-054-334-2 

; Sequence 2, Application US/09054334 
; Patent No. 6329163 

GENERAL INFORMATION: 

APPLICANT: Anderson, John P. 
; APPLICANT: Jacobson-Croak, Kirsten L. 

; APPLICANT: Sinha, Sukanto 

; TITLE OF INVENTION: Assays for Detecting Beta-Secretase 

; TITLE OF INVENTION: Inhibition 

NUMBER OF SEQUENCES: 6 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: Townsend and Townsend and Crew LLP 

STREET: Two Embarcader Center, Eighth Floor 

CITY: San Francisco 
; STATE: California 

COUNTRY: USA 
; ZIP: 94111-3834 

COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/054 , 334 

FILING DATE: 02-APR-1998 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/485,152 

FILING DATE: 07-JUN-1995 
ATTORNEY/AGENT INFORMATION: 
; NAME: Heslin, James M. 

REGISTRATION NUMBER: 2 9,541 



REFERENCE/ DOCKET NUMBER: 0152 7 0- 002 82 OUS 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 576-0200 
TELEFAX: (415) 576-0300 
INFORMATION FOR SEQ ID NO: 2: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 5 amino acids 

; TYPE: amino acid 

STRANDEDNESS : 
; TOPOLOGY : linear 

MOLECULE TYPE: peptide 
US-09-054-334-2 

Query Match 62.5%; Score 5; DB 4 ; Length 5; 

Best Local Similarity 100.0%; Pred. No. 3e+05; 

Matches 5; Conservative 0; Mismatches 0; Indels 

Qy 2 VNLDA 6 

Mill 

Db 1 VNLDA 5 



RESULT 37 

US- 0 9-72 4-5 6 6A-51 

; Sequence 51, Application US/09724566A 
; Patent No. 6627739 
; GENERAL INFORMATION: 



APPLICANT : 


: Anderson, John P. 


APPLICANT: 


: Basi, Guriqbal 


APPLICANT : 


: Doane, Minh Tarn 


APPLICANT : 


: Frigon, No. 6627739mand 


APPLICANT : 


: John, Varghese 


APPLICANT: 


: Power, Michael 


APPLICANT: 


: Sinha, Sukanto 


APPLICANT : 


: Tat suno , Gwen 


APPLICANT: 


: Tung, Jay 


APPLICANT : 


: Wang, Shuwen 


APPLICANT : 


: McConlogue , Lisa 



; TITLE OF INVENTION: Beta-Secretase Enzyme Compositions and 
TITLE OF INVENTION: Methods 
FILE REFERENCE: 228-US-NEWC2 

CURRENT APPLICATION NUMBER: US / 0 9 / 7 2 4 , 5 66A 

CURRENT FILING DATE: 2000-11-28 

PRIOR APPLICATION NUMBER: US 09/501,708 

PRIOR FILING DATE: 2000-02-10 

PRIOR APPLICATION NUMBER: 60/119,571 

PRIOR FILING DATE: 1999-02-10 

PRIOR APPLICATION NUMBER: 60/139,172 

PRIOR FILING DATE: 1999-06-15 

NUMBER OF SEQ ID NOS : 104 

SOFTWARE: Fast SEQ for Windows Version 4.0 
; SEQ ID NO 51 
LENGTH : 5 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-724-566A-51 



Query Match 62.5%; Score 5; DB 4; Length 5; 

Best Local Similarity 100.0%; Pred. No. 3e+05; 

Matches 5; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 2 VNLDA 6 

I I I I I 

Db 1 VNLDA 5 



RESULT 3 8 

US- 0 8-13 6-7 4 3B- 3 3 

; Sequence 33, Application US/08136743B 
; Patent No. 5459063 

GENERAL INFORMATION: 
; APPLICANT: Barry S. Cooperman, Harvey Rubin, 

; APPLICANT: Jerome Salem, and Alison L. Fisher 

; TITLE OF INVENTION: "Plasmodium falciparum Ribonu- 

; TITLE OF INVENTION: cleotide Reductase, DNA Sequences Therefor and Peptide 

inhibitor 

TITLE OF INVENTION: Thereof" 

NUMBER OF SEQUENCES: 67 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: The University of Pennsylvania 

; STREET: Suite 330 

STREET: 3700 Market Street 

CITY: Philadelphia 
; STATE: Pennsylvania 

COUNTRY: U.S.A. 
; ZIP: 19104-3246 

COMPUTER READABLE FORM: 
; MEDIUM TYPE: Diskette, 3.50 inch, 720 Kb 

COMPUTER: IBM PS/2 

OPERATING SYSTEM: MS-DOS 
; SOFTWARE: WordPerfect 5.1 

; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/136, 743B 

FILING DATE: 10/14/93 
; CLASSIFICATION: 435 

; ATTORNEY/ AGENT INFORMATION: 

; NAME: Monaco, Daniel A. 

REGISTRATION NUMBER: 30,480 

REFERENCE/ DOCKET NUMBER: 3957-10 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (215) 568-8383 

TELEFAX: (215) 568-5549 

TELEX: No. 5459063e 
; INFORMATION FOR SEQ ID NO: 33: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 7 amino acids 

; TYPE: amino acid 

; TOPOLOGY: linear 

US-0 8-136-7 4 3B-33 



Query Match 62.5%; Score 5; DB 1; Length 7; 

Best Local Similarity 100.0%; Pred. No. 3e+05; 

Matches 5; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 4 LDAEF 8 

I I I I I 

Db 3 LDAEF 7 



RESULT 39 
US-08-136-7 4 3B-35 

; Sequence 35, Application US/08136743B 
; Patent No. 5459063 

GENERAL INFORMATION: 
; APPLICANT: Barry S. Cooperman, Harvey Rubin, 

; APPLICANT: Jerome Salem, and Alison L. Fisher 

; TITLE OF INVENTION: "Plasmodium falciparum Ribonu- 

; TITLE OF INVENTION: cleotide Reductase, DNA Sequences Therefor and Peptide 

inhibitor 

; TITLE OF INVENTION: Thereof" 

NUMBER OF SEQUENCES: 67 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: The University of Pennsylvania 

; STREET: Suite 330 

; STREET: 3700 Market Street 

; CITY: Philadelphia 

; STATE: Pennsylvania 

COUNTRY: U.S.A. 

ZIP: 19104-3246 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette, 3.50 inch, 720 Kb 

COMPUTER: IBM PS/2 
; OPERATING SYSTEM: MS-DOS 

; SOFTWARE: WordPerfect 5.1 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US / 0 8 / 1 3 6 , 7 4 3B 

FILING DATE: 10/14/93 
; CLASSIFICATION: 435 

; ATTORNEY/AGENT INFORMATION: 

; NAME: Monaco, Daniel A. 

REGISTRATION NUMBER: 30,480 

REFERENCE/ DOCKET NUMBER: 3 957-10 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (215) 568-8383 

; TELEFAX: (215) 568-5549 

TELEX: No. 5459063e 
; INFORMATION FOR SEQ ID NO: 35: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH : 7 amino acids 

; TYPE: amino acid 

; TOPOLOGY: linear 

US- 0 8- 136-74 3B- 3 5 

Query Match 62.5%; Score 5; DB 1; Length 7; 

Best Local Similarity 100.0%; Pred. No. 3e+05; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 4 LDAEF 8 

I I I I I 

Db 3 LDAEF 7 



RESULT 4 0 
US-08-136-743B-56 

Sequence 56, Application US/08136743B 
Patent No. 5459063 
GENERAL INFORMATION: 

APPLICANT: Barry S. Cooperman, Harvey Rubin, 
APPLICANT: Jerome Salem, and Alison L. Fisher 
TITLE OF INVENTION: "Plasmodium falciparum Ribonu- 

TITLE OF INVENTION: cleotide Reductase, DNA Sequences Therefor and Peptide 
nhibitor 

TITLE OF INVENTION: Thereof" 
NUMBER OF SEQUENCES: 67 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: The University of Pennsylvania 
STREET: Suite 330 
STREET: 3700 Market Street 
CITY: Philadelphia 
STATE: Pennsylvania 
COUNTRY: U.S.A. 
ZIP: 19104-3246 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette, 3.50 inch, 720 Kb 
COMPUTER: IBM PS/2 
OPERATING SYSTEM: MS-DOS 
SOFTWARE: WordPerfect 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US / 0 8 / 1 3 6 , 7 4 3B 
FILING DATE: 10/14/93 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME : Monaco, Daniel A. 
REGISTRATION NUMBER: 30,480 
REFERENCE/DOCKET NUMBER: 3957-10 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (215) 568-8383 
TELEFAX: (215) 568-5549 
TELEX: No. 5459063e 
INFORMATION FOR SEQ ID NO: 56: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 7 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
US-08-136-743B-56 

Query Match 62.5%; Score 5; DB 1; Length 7; 

Best Local Similarity 100.0%; Pred. No. 3e+05; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 4 LDAEF 8 

I I I I I 

Db 3 LDAEF 7 



Search completed: March 26, 2004, 15:36:16 
Job time : 24.5 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

March 26, 2004, 15:30:37 



Run on : 



; Search time 13.5 Seconds 
(without alignments) 
57.002 Million cell updates/sec 



Title : 

Perfect score: 
Sequence : 



US-09-668-314C-71 
8 

1 EVNLDAEF 8 



Scoring table: OLIGO 

Gapop 60.0 , Gapext 60.0 



Searched: 
Word size : 



283366 seqs, 96191526 residues 



0 



283366 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Listing first 1000 summaries 

Database : FIR_78:* 

1 : pir 1 : * 

2 : pir2 : * 

3 : pir3 : * 

4 : pir4 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
D42725 



nitrile hydratase region 3 r -hypothetical protein P47K - Pseudomonas chlororaphis 
(strain B23) 

C; Species: Pseudomonas chlororaphis 

C;Date: 10-Mar-1994 #sequence_revision 10-Mar-1994 #text_change 20-Jun-2000 
C; Accession : D4272 5 

R;Nishiyama, M. ; Horinouchi, S.; Kobayashi, M. ; Nagasawa, T.; Yamada, H.; Beppu, 
T. 

J. Bacteriol. 173, 2465-2472, 1991 

A;Title: Cloning and characterization of genes responsible for metabolism of 
nitrile compounds from Pseudomonas chlororaphis B23. 
A;Reference number: A42725; MUID : 91 1932 02 ; PMID : 2 0135 68 
A; Accession : D42725 

A; Status : preliminary; not compared with conceptual translation 
A; Molecule type: DNA 
A;Residues: 1-419 <NIS> 

A; Cross-references : GB:D9 0216; NID: g2 168 50; PIDN : BAA142 4 7 . 1; PID: g2 16854 
C ; Super family : conserved hypothetical protein yciC 

Query Match 87.5%; Score 7; DB 2; Length 419; 

Best Local Similarity 100.0%; Pred. No. 2; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAE 7 

I I I I I I I 
Db 4 8 EVNLDAE 54 



RESULT 2 
F83867 

dihydrodipicolinate synthase BH1742 [imported] - Bacillus halodurans (strain C- 
125) 

C; Species: Bacillus halodurans 



C;Date: Ol-Dec-2000 #sequence_revision Ol-Dec-2000 #text_change 15-Jun-2001 
C;Accession: F83867 

R;Takami, H. ; Nakasone, K.; Takaki, Y.; Maeno, G . ; Sasaki, R. ; Masui, N.; Fuji, 
F . ; Hirama, C; Nakamura, Y. ; Ogasawara, N . ; Kuhara, S.; Horikoshi, K. 
Nucleic Acids Res. 28, 4317-4331, 2000 

A; Title: Complete genome sequence of the alkaliphilic bacterium Bacillus 

halodurans and genomic sequence comparison with Bacillus subtilis. 

A;Reference number: A83650; MUID : 2 05 12582 ; PMID : 11 058 132 

A;Accession: F83867 

A; Status : preliminary 

A; Molecule type: DNA 

A; Residues: 1-295 <STO> 

A; Cross-references : GB:AP001513; GB: BA00 0 00 4 ; NID: gl017434 5; PIDN : BAB054 61 . 1; 
GSPDB:GN00137 

A; Experimental source: strain C-125 

C ; Genetics : 

A; Gene: BH1742 

C; Superf amily : dihydrodipicolinate synthase 

Query Match 75.0%; Score 6; DB 2; Length 295; 

Best Local Similarity 100.0%; Pred. No. 17; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDAE 7 

I I I I I I 

Db 142 VNLDAE 147 



RESULT 3 
A83087 

conserved hypothetical protein PA4457 [imported] - Pseudomonas aeruginosa 
(strain PAOl) 

C; Species: Pseudomonas aeruginosa 

C;Date: 15-Sep-2000 #sequence_revision 15-Sep-2000 #text_change 31-Dec-2000 
C;Accession: A83087 

R; Stover, C.K.; Pham, X.Q.; Erwin, A.L.; Mizoguchi, S.D.; Warrener, P.; Hickey, 
M.J.; Brinkman, F.S.L.; Hufnagle, W.O.; Kowalik, D.J.; Lagrou, M. ; Garber, R.L. 
Goltry, L.; Tolentino, E.; Wes tbrook-Wadman, S.; Yuan, Y.; Brody, L.L.; Coulter 
S.N.; Folger, K.R.; Kas, A.; Larbig, K.; Lim, R.M.; Smith, K.A. ; Spencer, D.H.; 
Wong, G.K.S.; Wu, Z.; Paulsen, I.T.; Reizer, J.; Saier, M.H.; Hancock, R.E.W.; 
Lory, S.; Olson, M.V. 
Nature 406, 959-964, 2000 

A; Title: Complete genome sequence of Pseudomonas aeruginosa PAOl, an 
opportunistic pathogen. 

A; Reference number: A82950; MUID : 2 0437337 ; PMID : 1098 4 043 
A;Accession: A83087 
A; Status : preliminary 
A;Molecule type: DNA 
A; Residues: 1-326 <STO> 

A; Cross -references : GB: AE004 8 60; GB: AE004 091 ; NID : g9950 692 ; PIDN : AAG078 4 5 . 1; 

GSPDB:GN0 0131; PASP: PA44 57 

A; Experimental source: strain PAOl 

C ; Genetics : 

A; Gene: PA4 457 

C; Superf amily : probable ATP-binding protein gutQ; CBS homology 



Query Match 



75.0%; Score 6; DB 2; Length 326; 



Best Local Similarity 100.0%; Pred. No. 19; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDA 6 

I I I 1 I I 

Db 139 EVNLDA 14 4 



RESULT 4 
A28825 

keratin, type I nonepidermal - African clawed frog 
C; Species: Xenopus laevis (African clawed frog) 

C;Date: 30-Jun-1989 #sequence_revision 30-Jun-1989 #text_change 13-Aug-1999 
C;Accession : A2 8 825 

R; LaFlamme , S.E.; Jamrich, M. ; Richter, K. ; Sargent, T.D.; Dawid, I.B. 
Genes Dev. 2, 853-862, 1988 

A;Title: Xenopus endo B is a keratin preferentially expressed in the embryonic 
notochord . 

A; Reference number: A28825; MUID : 8 9092 007 ; PMID:2463213 
A; Access ion: A2 8 825 
A; Molecule type : mRNA 
A; Residues: 1-368 <LAF> 

A; Cross-references : GB : Y 0 0 2 3 0 ; NID : g64 8 63 ; PIDN : CAA68 372 . 1; PID: g64 8 64 

C ; Genetics : 

A; Start codon : GGT 

C ; Super family : cytoskeletal keratin 
C; Keywords: coiled coil 

Query Match 75.0%; Score 6; DB 2; Length 368; 

Best Local Similarity 100.0%; Pred. No. 21; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 VNLDAE 7 

IE I I I I 

Db 220 VNLDAE 225 



RESULT 5 
F69888 

alanine racemase homolog yncD - Bacillus subtilis 
C; Species: Bacillus subtilis 

C;Date: 05-Dec-1997 #sequence_revision 05-Dec-1997 #text_change 20-Jun-2000 
C; Access ion : F698 8 8 

R;Kunst, F. ; Ogasawara, N . ; Moszer, I.; Albertini, A.M.; Alloni, G . ; Azevedo, 
V.; Bertero, M.G.; Bessieres, P.; Bolotin, A.; Borchert, S.; Boriss, R. ; 
Boursier, L.; Brans, A.; Braun, M. ; Brignell, S.C.; Bron, S.; Brouillet, S.; 
Bruschi, C.V.; Caldwell, B.; Capuano, V.; Carter, N.M.; Choi, S.K.; Codani, 
J.J.; Connerton, I.F.; Cummings, N.J.; Daniel, R.A. ; Denizot, F. ; Devine, K.M.; 
Duesterhoef t , A.; Ehrlich, S.D.; Emmerson, P.T.; Entian, K.D.; Errington, J.; 
Fabret, C; Ferrari, E. 
Nature 390, 249-256, 1997 

A; Authors: Foulger, D.; Fritz, C; Fujita, M. ; Fujita, Y. ; Fuma, S.; Galizzi, 
A.; Galleron, N.; Ghim, S.Y.; Glaser, P.; Goffeau, A.; Golightly, E.J.; Grandi, 
G.; Guiseppi, G . ; Guy, B.J.; Haga, K.; Haiech, J.; Harwood, C.R.; Henaut, A.; 
Hilbert, H.; Holsappel, S.; Hosono, S.; Hullo, M.F.; Itaya, M. ; Jones, L.; 
Joris, B.; Karamata, D.; Kasahara, Y . ; Klaerr-Blanchard, M. ; Klein, C; 



Kobayashi, Y. ; Koetter, P.; Koningstein, G . ; Krogh, S.; Kumano, M. ; Kurita, K.; 
Lapidus, A.; Lardinois, S. 

A;Authors : Lauber, J.; Lazarevic, V.; Lee, S.M.; Levine, A.; Liu, H.; Masuda, 
S.; Maueel, C; Medigue, C; Medina, N.; Mellado, R.P.; Mi zuno, M. ; Moestl, D. ; 
Nakai, S.; Noback, M. ; Noone, D.; O'Reilly, M. ; Ogawa, K. ; Ogiwara, A.; Oudega, 
B.; Park, S.H.; Parro, V.; Pohl, T.M. ; Portetelle, D.; Porwolik, S.; Prescott, 
A.M.; Presecan, E . ; Pujic, P.; Purnelle, B.; Rapoport, G . ; Rey, M. ; Reynolds, 
S.; Rieger, M. ; Rivolta, C.; Rocha, E . ; Roche, B.; Rose, M. ; Sadaie, Y. ; Sato, 
T . ; Scanlon, E . 

A; Authors: Schleich, S.; Schroeter, R. ; Scoff one, F. ; Sekiguchi, J.; Sekowska, 
A.; Seror, S.J.; Serror, P.; Shin, B.S.; Soldo, B.; Sorokin, A.; Tacconi, E . ; 
Takagi, T.; Takahashi, H.; Takemaru, K.; Takeuchi, M. ; Tamakoshi, A.; Tanaka, 
T.; Terpstra, P.; Tognoni, A.; Tosato, V.; Uchiyama, S.; Vandenbol, M. ; Vannier, 
F. ; Vassarotti, A.; Viari, A.; Wambutt, R. ; Wedler, E . ; Wedler, H.; 
Weitzenegger, T.; Winters, P.; Wipat, A.; Yamamoto, H.; Yamane, K. ; Yasumoto, 
K.; Yata, K. ; Yoshida, K. 

A; Authors: Yoshikawa, H.F.; Zumstein, E.; Yoshikawa, H.; Danchin, A. 

A; Title: The complete genome sequence of the Gram-positive bacterium Bacillus 

subtilis . 

A; Reference number: A69580; MUID : 98044033 ; PMID: 9384377 
A; Accession: F698 8 8 

A; Status : preliminary; nucleic acid sequence not shown; translation not shown 
A;Molecule type: DNA 
A; Residues: 1-394 <KUN> 

A; Cross -references : GB: Z 99113; GB:AL009126; NID : g2 634 090 ; PIDN : CAB1364 8 . 1; 
PlD:g2634148 

A; Experimental source: strain 168 
C ; Genetics : 
A; Gene: yncD 

C; Superf amily : alanine racemase 

Query Match 75.0%; Score 6; DB 2; Length 394; 

Best Local Similarity 100.0%; Pred. No. 22; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDA 6 

I I I I I I 

Db 11 EVNLDA 16 



RESULT 6 
JH0776 

hydrogenase (EC 1.18.99.1) large chain - Alcaligenes hydrogenophilus 
C; Species: Alcaligenes hydrogenophilus 

C;Date: 10-Sep-1999 #sequence_revision 10-Sep-1999 #text_change 10-Sep-1999 
C;Accession: JH0776 

R;Yagi, K. ; Seto, T.; Terakado, M. ; Umeda, F. ; Doi, T . ; Imanishi, T.; Miura, Y. 
Chem. Pharm. Bull. 40, 3292-3296, 1992 

A;Title: Nucleotide sequences of membrane-bound hydrogenase gene in Alcaligenes 
hydrogenophilus . 

A;Reference number: JH0775; MUID: 93193199; PMID: 1294332 
A;Accession: JH0776 
A; Molecule type: DNA 
A; Residues: 1-619 <YAG> 

A; Cross-references : GB : S 5 6 8 9 8 ; NID: g29 9291; PIDN :AAB2 57 8 0 . 1; PID: g299293 
C ; Genetics : 
A; Gene : hupL 



C; Function : 

A; Pathway: hydrogen metabolism 

A;Note: contains iron-sulfur and nickel 

C; Superf amily : hydrogenase (NiFe) large chain 

C; Keywords: hydrogen metabolism; iron; iron-sulfur protein; membrane bound; 
metalloprotein; nickel; oxidoreductas e 

F;75,78, 6 03 /Binding site: nickel (Cys) #status predicted 
F; 78 , 60 3 /Binding site: iron (Cys) ^status predicted 
F;82/Active site: His #status predicted 

Query Match 75.0%; Score 6; DB 1; Length 619; 

Best Local Similarity 100.0%; Pred. No. 33; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDA 6 

Mill! 

Db 33 EVNLDA 38 



RESULT 7 
C90326 

hypothetical protein SS01653 [imported] - Sulfolobus solfataricus 
C; Species: Sulfolobus solfataricus 

C;Date: 24-May-2001 #sequence_revision 24-May-2001 #text_change 24-May-2001 
C; Accession: C9032 6 

R;She, Q. ; Singh, R.K.; Conf alonieri , F. ; Zivanovic, Y.; Allard, G.; Awayez, 
M.J.; Chan-Weiher, C.C.Y.; Clausen, I.G.; Curtis, B.A. ; De Moors, A.; Erauso, 
G.; Fletcher, C; Gordon, P.M.K.; Heikamp-de Jong, I.; Jeffries, A.C.; Kozera, 
C.J.; Medina, N.; Peng, X.; Thi-Ngoc, H.P.; Redder, P.; Schenk, M.E.; Theriault, 
C; Tolstrup, N.; Charlebois, R.L.; Doolittle, W.F.; Duguet, M. ; Gaasterland, 
T.; Garrett, R.A. ; Ragan, M.A. ; Sensen, C.W.; Van der Oost, J. 
submitted to GenBank, April 2001 

A; Description : Sulfolobus solfataricus complete genome. 

A; Reference number: A9 913 9 

A; Accession: C9032 6 

A; Status: preliminary 

A; Molecule type: DNA 

A; Residues: 1-8 02 <KUR> 

A;Cross-references : GB:AE006641; NID : gl38 14 900 ; PIDN : AAK4 18 66 . 1 ; GSPDB : GN00155 
C ; Genetics : 
A;Gene: SS01653 

Query Match 75.0%; Score 6; DB 2; Length 802; 

Best Local Similarity 100.0%; Pred. No. 42; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDA 6 

MINI 

Db 2 92 EVNLDA 2 97 



RESULT 8 
G82193 

aminopeptidase N VC1494 [imported] - Vibrio cholerae {strain N16961 serogroup 
Ol) 

C; Species: Vibrio cholerae 

C;Date: 18~Aug-2000 #sequence_revision 20-Aug-2000 #text_change 02-Feb-2001 



C; Access ion: G82193 

R; Heidelberg, J.F.; Eisen f J. A. ; Nelson, W.C.; Clayton, R.A. ; Gwinn, M.L.; 
Dodson, R.J.; Haft, D.H.; Hickey, E.K.; Peterson, J.D.; Umayam, L.A. ; Gill, 
S.R. ; Nelson, K.E.; Read, T.D.; Tettelin, H.; Richardson, D.; Ermolaeva, M. 
Vamathevan, J.; Bass, S.; Qin, H.; Dragoi, I.; Sellers, P.; McDonald, L.; 
Utterback, T.; Fleishmann, R.D.; Nierman, W.C.; White, O . ; Salzberg, S.L.; 
Smith, H.O.; Colwell, R.R.; Mekalanos, J. J.; Venter, J.C.; Fraser, CM. 
Nature 406, 477-483, 2000 

A;Title: DNA Sequence of both chromosomes of the cholera pathogen Vibrio 
cholerae . 

A;Reference number: A82035; MUID : 2 04 0 68 33 ; PMID : 10952 3 01 
A; Accession : G82193 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-868 <HEI> 

A; Cross-references : GB : AE004227 ; GB : AE0 038 52 ; NID : g9655990 ; PIDN : AAF94 64 9 . 1 
GSPDB: GN0012 6; TIGR: VC14 94 

A; Experimental source: serogroup Ol; strain N16961; biotype El Tor 

C ; Genetics : 

A; Gene: VC14 94 

A; Map position: 1 

C; Superf amily : microsomal aminopeptidase 

Query Match 75.0%; Score 6; DB 2; Length 868; 

Best Local Similarity 100.0%; Pred. No. 45; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 3 NLDAEF 8 

MINI 

Db 606 NLDAEF 611 



RESULT 9 
A54794 

dynein heavy chain, cytosolic - fruit fly (Drosophila melanogas ter ) 
N; Contains: dynein ATPase (EC 3.6.4.2) 
C; Species: Drosophila melanogaster 

C;Date: 10-Sep-1999 #sequence_revision 10-Sep-1999 #text_change 19-Apr-2002 
C;Accession: A54794 

R; Li , M. ; McGrail, M. ; Serr, M. ; Hays, T.S. 
J. Cell Biol. 126, 1475-1494, 1994 

A;Title: Drosophila cytoplasmic dynein, a microtubule motor that is 
asymmetrically localized in the oocyte. 

A; Reference number: A54794; MUID : 9437 552 4 ; PMID: 8089180 
A;Accession: A54794 
A;Molecule type: mRNA 
A; Residues: 1-4639 <LIA> 

A; Cross-references : GB : L23195; NID : g34 9 668 ; PIDN : AAA60323 .1; PID:g3 49669 
C ; Genetics : 

A; Gene: FlyBase : Dhc64C 

A; Cross-references : FlyBase : FBgn0 0103 4 9 

C; Superf amily : dynein heavy chain, cytosolic 

C; Keywords: ATP; heterotet ramer ; hydrolase; microtubule binding; nucleotide 
binding; P-loop 

F; 1895-1902/ Region : nucleotide-binding motif A (P-loop) 
F; 2210-22 17/Region : nucleotide-binding motif A (P-loop) 
F; 2580-2587/ Region : nucleotide-binding motif A (P-loop) 



F;2922-2929/Region: nucleoticle-binding motif A (P-loop) 
F; 1901/Binding site: ATP (Lys) #status predicted 
F; 2216/Binding site: ATP (Lys) #status predicted 
F; 2 5 8 6/Binding site: ATP (Lys) #status predicted 
F;2928/Binding site: ATP (Lys) #status predicted 



Query Match 75.0%; Score 6; DB 1; Length 4639; 

Best Local Similarity 100.0%; Pred. No. 2e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



3 NLDAEF 8 

MINI 
3885 NLDAEF 3890 



RESULT 10 
S40015 

phd protein - phage PI 
C; Species: phage PI 

C;Date: 25-Dec-1994 #sequence__revision 23-Feb-1996 #text_change 08-Oct-1999 
C;Accession: S40015; S38553 

R;Lehnherr, H.; Maguin, E . ; Jafri, S.; Yarmolinsky, M.B. 
J. Mol. Biol. 233, 414-428, 1993 

A; Title: Plasmid addiction genes of bacteriophage PI: doc, which causes cell 
death on curing of prophage, and phd, which prevents host death when prophage is 
retained . 

A; Reference number: S40015; MUID : 94 016561 ; PMID: 8411153 
A;Accession: S40015 
A;Molecule type: DNA 
A; Residues: 1-73 <LEH> 

A; Cross-references : GB : M95666 ; NID : g4 6327 6 ; PIDN : AAA169 32 .1; PID: g2 15645 
R;Schmidt, C; Lehnherr, H.; Guidolin, A.; Arber, W. 
submitted to the EMBL Data Library, November 1992 

A; Description : Additional late promoter sequences of bacteriophage PI. 
A; Reference number: S38553 
A;Accession: S38553 
A; Molecule type: DNA 
A; Residues: 1-73 <SCH> 

A; Cross-references : EMBL:M95666; NID: g4 63276; PIDN : AAA16932 .1; PID: g2 15 64 5 

Query Match 62.5%; Score 5; DB 2; Length 73; 

Best Local Similarity 100.0%; Pred. No. 60; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 4 LDAEF 8 

I I I I 1 

Db 52 LDAEF 5 6 



RESULT 11 
S57268 

translation elongation factor aEF-1 beta - Sulfolobus solfataricus 
C; Species: Sulfolobus solfataricus 

C;Date: 28-Oct-1995 #sequence_revision 03-Nov-1995 #text_change 08-Oct-1999 
C;Accession: S57268; S65743 

R;Arcari, P.; Raimo, G . ; Ianniciello, G . ; Gallo, M. ; Bocchini, V. 
Biochim. Biophys . Acta 1263, 86-88, 1995 



A;Title: The first nucleotide sequence of an archaeal elongation factor 1— beta 
gene . 

A;Reference number: S57268; MUID : 953592 09 ; PMID:7632739 
A;Accession: S57268 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-91 <ARC> 

A; Cross -references : EMBL: X7 67 68 ; NID : g4 9 6732 ; PIDN : CAA54 1 64 .1; PID: g4 96733 
R;Raimo, G. ; Masullo, M. ; Savino, G.; Scarano, G.; Ianniciello, G.; Parente, A. ; 
Bocchini, V. 

Biochim. Biophys . Acta 1293, 106-112, 1996 

A;Title: Archaeal elongation factor 1-beta is a dimer . Primary structure, 
molecular and biochemical properties. 

A;Reference number: S65743; MUID : 9618 62 82; PMID:8652615 
A; Access ion: S 657 43 
A;Molecule type: protein 
A; Residues: 2-91 <RAI> 

C; Superf amily : Sulfolobus solfataricus translation elongation factor aEF-1 beta 
C;Keywords: homodimer 

Query Match 62.5%; Score 5; DB 2; Length 91; 

Best Local Similarity 100.0%; Pred. No. 73; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLD 5 

MM! 

Db 16 EVNLD 2 0 



RESULT 12 
A75188 

translation elongation factor ef-1, chain beta PAB3009 - Pyrococcus abyssi 
(strain Orsay) 

C; Species: Pyrococcus abyssi 

C;Date: 20-Aug-1999 #sequence_revision 20-Aug-1999 #text_change 20-Aug-1999 

C;Accession: A75188 

R; anonymous, Genoscope 

submitted to the EMBL Data Library, July 1999 

A; Description : Pyrococcus abyssi genome sequence: insights into archaeal 

chromosome structure and evolution. 

A; Reference number: A75001 

A;Accession: A75188 

A; Status : preliminary 

A;Molecule type: DNA 

A; Residues: 1-91 <KAW> 

A; Cross- references : GB: AJ24 8283; GB: AL0 968 36; NID : g54 57433 ; PIDN : CAB4 8 952 . 1; 

PID : el5 14846; PID : g54 574 61 

A; Experimental source: strain Orsay 

C ; Genetics : 

A; Gene: PAB3 00 9 

Query Match 62.5%; Score 5; DB 2; Length 91; 

Best Local Similarity 100.0%; Pred. No. 73; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 EVNLD 5 

M M I 



KW N, N ' -Substituteci-1 , 3-diamino-2-hydroxypropane derivative; 

KW Alzheimer's disease; Down's syndrome; hereditary cerebral haemorrhage; 

KW Dutch-Type amyloidosis; cerebral angiopathy; degenerative dementia; 

KW Parkinson 1 s disease; progressive supranuclear palsy; 

KW cortical basal degeneration; beta-secretase; amyloid precursor protein; 

KW APP; substrate. 

XX 

OS Synthetic. 
XX 

FH Key Location/Qualif ier s 

FT Cleavage-site 5. .6 

FT /note= "Beta-secretase cleavage site" 

XX 

PN W02 0 0304 009 6-A2 . 
XX 

PD 15-MAY-2003. 
XX 

PF 08-NOV-2002; 2 002WO-US036072 . 
XX 

PR 08-NOV-2001; 2 001US- 0337 122 P . 

PR 28-DEC-2001; 2 001US-0344086P . 

PR 03-JAN-2002; 2 002US-0345635P . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA (PHAA ) PHARMACIA & UPJOHN CO. 

XX 

PI John V, Maillard M, Jagodzinska B, Beck JP, Gailunas A, Fang L; 

PI Sealy J, Tenbrink R, Freskos J, Mickelson J, Samala L, Horn R; 
XX 

DR WPI; 2003-505059/47. 
XX 

PT New N, N ' -substituted-1 , 3-diamino-2-hydroxypropane derivatives are beta 

PT secretase inhibitors used for treating e.g. Alzheimer's disease, Down's 

PT syndrome and hereditary cerebral hemorrhage. 
XX 

PS Example D; SEQ ID NO 8; 1243pp; English. 
XX 

CC The invention comprises N, N ' -Subs tituted- 1 , 3-diamino-2-hydroxypropane 

CC derivatives. The compounds of the invention are useful for treating: 

CC Alzheimer's disease, Down's syndrome, hereditary cerebral haemorrhage 

CC with amyloidosis of the Dutch-Type, cerebral angiopathy, degenerative 

CC dementia, Parkinson's disease, progressive supranuclear palsy and 

CC cortical basal degeneration, and diffuse Lewy body of Alzheimer's 

CC disease. The compounds of the invention also inhibit beta-secretase 

CC mediated cleavage of amyloid precursor protein (APP) . The present amino 

CC acid sequence represents a synthetic oligopeptide substrate that can be 

CC cleaved by beta-secretase. 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 7; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 36 
ADC26557 

ID ADC26557 standard; peptide; 9 AA. 
XX 

AC ADC26557; 
XX 

DT 18-DEC-2003 (first entry) 
XX 

DE Beta-secretase cleavage site peptide SEQ ID 4. 
XX 

KW nootropic; neuroprotective; Alzheimer's disease; 

KW beta-secretase cleavage site. 

XX 

OS Unidentified. 
XX 

PN WO2003065012-A2 . 
XX 

PD 07-AUG-2003 . 
XX 

PF 03-FEB-2003; 2 003WO-US003236 . 
XX 

PR 01-FEB-2002; 2002US-00066319 . 
XX 

PA (UNMI ) UNIV MICHIGAN. 
XX 

PI Ross BD, Rehemtulla A; 
XX 

DR WPI; 2003-663499/62. 
XX 

PT New chimeric polypeptides and nucleic acids , useful for detecting and 

PT measuring protease activity, for identifying modulators of protease 

PT activity for detecting and for preventing or ameliorating Alzheimer's 

PT disease. 
XX 

PS Claim 15; SEQ ID NO 4; 41pp; English. 

XX 

CC The invention relates to a novel chimeric nucleic acid encoding a 

CC polypeptide which comprises first, second and third domains, where the 

CC first domain comprises a Golgi retention signal peptide or an endoplasmic 

CC reticulum (ER) retention signal peptide, the second domain comprises a 

CC protease cleavage site and the third domain comprises a reporter molecule 

CC and where the protease cleavage site is between the Golgi retention 

CC signal peptide and the reporter molecule. The molecules of the invention 

CC demonstrate nootropic and neuroprotective activities and may be useful 

CC for detecting and measuring protease activity or for identifying 

CC modulators of protease activity in order to detect, prevent or ameliorate 

CC Alzheimer's disease. The current sequence is that of the beta-secretase 

CC cleavage site peptide SEQ ID 4 of the invention. 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 7; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 EVNLDAEF 8 

! I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 37 
ADC10532 

ID ADC10532 standard; peptide; 9 AA. 
XX 

AC ADC10532; 
XX 

DT 18-DEC-2003 (first entry) 
XX 

DE Synthetic beta secretase substrate, SEQ ID NO : 8 . 
XX 

KW Substituted hydroxyethylamine ; beta secretase inhibitor; 

KW amyloid precursor protein; APP; amyloid beta peptide; A beta; 

KW neurodegenerative disorder; Alzheimer's disease; 

KW mild cognitive impairment; MCI; Down's syndrome; 

KW Hereditary Cerebral Haemorrhage with Amyloidosis of the Dutch-type; 

KW HCHWAD; cerebral amyloid angiopathy; degenerative dementia; 

KW Parkinson's disease; progressive supranuclear palsy; 

KW cortical basal degeneration; 

KW diffuse Lewy body type of Alzheimer's disease; lobar haemorrhage; 

KW nootropic; neuroprotective; cerebroprotective ; haemostatic; vasotropic; 

KW synthetic substrate. 

XX 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 

FT Cleavage-site 5. .6 

FT /note= "Beta secretase cleavage site" 

XX 

PN WO2003072535-A2 . 
XX 

PD 04-SEP-2003. 
XX 

PF 27-FEB-2003; 2003WO-US007287 . 
XX 

PR 27-FEB-2002; 2002US-0359953P . 
XX 

PA (ELAN-) ELAN PHARM INC. 

PA (PHAA ) PHARMACIA & UPJOHN. 

XX 

PI Varghese J, Jagodzinska B, Maillard M, Beck JP, Tenbrink RE; 

PI Getman D; 

XX 

DR WPI; 2003-731584/69. 
XX 

PT New substituted hydroxyethylamines , useful for the treatment of 

PT Alzheimer's disease, mild cognitive impairment and dementia, are beta- 

PT secretase, beta-amyloid peptide production inhibitor, and beta-deposition 

PT inhibitors . 

XX 

PS Example D; Page 428; 483pp; English. 
XX 

CC The invention relates to substituted hydroxyethylamines or their salts 



CC which are of use in the treatment of Alzheimer's disease and other 

CC neurodegenerative disorders . The compounds act as beta secretase 

CC inhibitors, preventing the cleavage of amyloid precursor protein (APP) to 

CC form the amyloid beta (A beta) peptide. The amyloid beta peptide is 

CC deposited as plaques, which is thought to play a key role in the 

CC pathogenesis of ALzheimer's disease. The compounds of the invention may 

CC be used for the treatment or prevention of Alzheimer's disease, mild 

CC cognitive impairment (MCI), Down's syndrome, Hereditary Cerebral 

CC Haemorrhage with Amyloidosis of the Dutch-type (HCHWAD) , cerebral amyloid 

CC angiopathy, other degenerative dementias (including dementias of mixed 

CC vascular and degenerative origin, dementia associated with Parkinson's 

CC disease, dementia associated with progressive supranuclear palsy, 

CC dementia associated with cortical basal degeneration) and diffuse Lewy 

CC body type of Alzheimer's disease. The compounds are also useful in 

CC preventing or delaying the onset of Alzheimer's disease in those who 

CC would progress from mild cognitive impairment to Alzheimer's disease, and 

CC for preventing the potential consequences of cerebral amyloid angiopathy 

CC (i.e., single and recurrent lobar haemorrhages). Sequences ADC10529- 

CC ADC10531 represent synthetic substrates which can be used to assay beta- 

CC secretase activity in the presence or absence of test compounds of the 

CC invention . 

XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 7; Length 9; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



RESULT 3 8 
ADD80769 

ID ADD80769 standard; peptide; 9 AA. 
XX 

AC ADD80769; 
XX 

DT 29-JAN-2004 (first entry) 
XX 

DE Synthetic APP-like peptide #6. 
XX 

KW APP; amyloid precursor protein; Alzheimer's disease; beta-secretase; 

KW nootropic; neuroprotective; cerebroprotecti ve ; hemostatic; 

KW mild cognitive impairment; MCI; Down's syndrome; 

KW hereditary cerebral haemorrhage; Dutch-type amyloidosis; 

KW cerebral amyloid angiopathy; degenerative dementia; diffuse Lewy body; 

KW amyloid beta protein. 

XX 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 

FT Cleavage-site 5 . .6 

FT /note= "Beta secretase cleavage site" 

XX 

PN US 2 003 1 0955 9-Al . 



XX 

PD 12-JUN-2003. 
XX 

PF ll-JUL-2002; 2 002US- 00193 04 4 . 
XX 

PR ll-JUL-2001; 2 0 01US-030452 5P . 

PR 30-JUL-2001; 2 0 01US- 03 0 8 7 5 6P . 

PR 17-DEC-2001; 2 001US- 034 134 lp . 

PR 17-DEC-2001; 2001US-03414 16P . 

PR 21-DEC-2001; 2001US-0344872P . 

PR 14-MAY-2002; 2 002US- 038 0574P . 
XX 

PA (GAIL/) GAILUNAS A. 

PA (TUCK/) TUCKER J A. 

PA (TENB/) TENBRINK R. 

PA (MICK/) MICKELSON J. 
XX 

PI Gailunas A, Tucker JA, Tenbrink R, Mickelson J; 
XX 

DR WPI; 2003-863424/80. 
XX 

PT New N- (3-amino-2-hydroxy-propyl) substituted alkylamide compounds are beta 

PT -secretase activity inhibitors, useful for treating e.g. Alzheimer's 

PT disease, Down's syndrome, mild cognitive impairment, cerebral amyloid 

PT angiopathy. 
XX 

PS Example D; SEQ ID NO 8; 70pp; English. 
XX 

CC The invention relates to N- ( 3— Amino-2-hydroxy-propyl ) substituted 

CC alkylamide compounds (of structure detailed in the specification) or 

CC their salts. Also included is the preparation of the N- ( 3-Amino-2 -hydroxy 

CC -propyl) substituted alkylamide compounds. The compounds are useful in a 

CC medicament for treating, preventing or delaying the onset of Alzheimer's 

CC disease, for treating mild cognitive impairment (MCI), Down's syndrome, a 

CC human having a hereditary cerebral haemorrhage with amyloidosis of the 

CC Dutch-type, cerebral amyloid angiopathy, other degenerative dementias and 

CC diffuse Lewy body type of Alzheimer's disease. The compounds are potent 

CC amyloid beta peptide production inhibitors, hence slow down the 

progression of Alzheimer's disease and prevent the disease in the initial 
phases. The present sequence is a synthetic peptide containing a beta 
secretase cleavage site similar to that of APP (amyloid precursor 

CC protein) , used to test the compounds of the invention. 
XX 

SQ Sequence 9 AA; 

Query Match 100.0%; Score 8; DB 7; Length 9; 
Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 EVNLDAEF 8 

I I I I I I I I 

Db 2 EVNLDAEF 9 



CC 
CC 
CC 



RESULT 39 
AAW08362 

ID AAW08362 standard; peptide; 10 AA. 



XX 

AC AAW08362; 
XX 

DT 05-SEP-1997 (first entry) 
XX 

DE Beta-secretase substrate #3. 
XX 

KW Beta-cleavage site; beta amyloid precursor protein; APP; beta-secretase; 

KW alpha-secretase; proteolytic cleavage; inhibitor; Alzheimer's disease. 
XX 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 

FT Misc-dif f erence 1 

FT /note- "acetylated" 
XX 

PN WO9640885-A2 . 
XX 

PD 19-DEC-1996. 
XX 

PF 07-JUN-1996; 96WO-US009985 . 
XX 

PR 07-JUN-1995; 95US-004 8 04 9 8 . 

PR 07-JUN-1995; 95US-0 04 8 5152 . 
XX 

PA (ATHE-) ATHENA NEUROSCIENCES INC. 
XX 

PI Chrysler SMS , Sinha S, Keim PS, Anderson JP, Jacobson-Croak KL; 

PI Tan H, Mcconlogue LC; 

XX 

DR WPI; 1997-052304/05. 
XX 

PT Beta-secretase which specifically cleaves beta-amyloid precursor protein 

PT - useful to screen for inhibitors useful in treatment of Alzheimer's 

PT disease. 
XX 

PS Disclosure; Page 45; 92pp; English. 
XX 

CC AAW08359-W08362 represent substrates for the enzyme of the invention. The 

CC enzyme of the invention is beta-secretase, and specifically cleaves beta- 

CC amyloid precursor protein (beta-APP) . Normal processing of beta-APP is 

CC thought to occur via cleavage between residues 16 and 17 of the beta- 

CC amyloid peptide region by an alpha-secretase. Pathogenic processing is 

CC thought to occur by beta-secretase cleavage of beta-APP. Beta-secretase 

CC activity can be detected and measured using a method of the invention, 

CC which detects at least one of the beta-secretase cleavage products formed 

CC on cleavage. The method can be used to determine whether a test substance 

CC inhibits proteolytic cleavage, by beta-secretase, of beta-APP. Compounds 

CC effective to at least partially inhibit beta-secretase activity can be 

CC used to inhibit cleavage of beta-APP in cells or mammalian hosts. 

CC Isolation and purification of beta-secretase will permit chemical 

CC modelling of a critical event in the pathology of Alzheimer's disease 
XX 

SQ Sequence 10 AA; 



Query Match 100.0%; Score 8; DB 2; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.019; 



Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



AAY33756; 

09-NOV-1999 (first entry) 
Synthetic oligopeptide 5-5 'SW. 

Beta-secretase; beta-amyloid protein precursor; APP; Down's syndrome; 
Alzheimer's disease; measure activity; cleavage site. 



RESULT 4 0 
AAY33756 

ID AAY33756 standard; protein; 10 AA. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
XX 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 



Synthetic . 
Key 

Modif ied-site 



US5942400-A 

24-AUG-1999 

07-JUN-1996 

07-JUN-1995 
07-JUN-1995 



Location/ Quali f iers 
1 

/note= "N-terminal Ser is acetylated" 



96US-00659984 

95US-00480498 
95US-00485152 



(ELAN-) ELAN PHARM INC. 

Sinha S, Jacobson-Croak KL, Anderson JP; 
WPI; 1999-517417/43. 

A method for detecting human beta-secretase cleavage of polypeptides 
useful for identifying beta-secretase inhibitors. 

Example; Col 30; 4 3pp; English. 

Sequences AAY33752 -Y337 56 are synthetic oligopeptides used for measuring 
the activity of beta-secretase (AAY33741) . Beta-secretase is capable of 
cleaving beta-amyloid protein precursor (APP) (AAY33742) . These synthetic 
peptides contain the cleavage site of APP. Beta-secretase and APP are 
used in a method for detecting human beta-secretase cleavage of 
polypeptides and for identifying beta-secretase inhibitors. Inhibition of 
beta-secretase activity would be useful for chemical modelling of a 
critical event in the pathology of Alzheimer's disease. Inhibitors of 
beta-secretase would be useful for the prevention and treatment of 
Alzheimer's disease and Down's Syndrome 



SQ Sequence 10 AA; 



Query Match 100.0%; Score 8; DB 2; Length 10; 

Best Local Similarity 100.0%; Pred. No. 0.019; 

Matches 8; Conservative 0; Mismatches 0; Indels 

Qy 1 EVNLDAEF 8 

I I I I I I I I 
Db 2 EVNLDAEF 9 



Search completed: March 26, 2004, 15:33:30 
Job time : 93.5 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: March 26, 2004, 15:30:37 ; 



Search time 16.5 Seconds 
(without alignments) 
25.031 Million cell updates/sec 



Title : 

Perfect score 
Sequence : 



US-09-668-314C-71 
8 

1 EVNLDAEF 8 



Scoring table: OLIGO 

Gapop 60.0 , Gapext 60.0 



Searched : 



Word size 



389414 seqs, 51625971 residues 



0 



Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Listing first 1000 summaries 



389414 



Database : 



1 
2 
3 
4 
5 
6 



Issued_Patents_AA: * 

: /cgn2_6/ptodata/2/iaa/5A_COMB.pep : * 
: / cgn2_6/ptodata/2/iaa/5B_COMB . pep : * 



/ ^ynz._u/ pLuud uci/ z / idd/ DJx^yKjFiB . pep : 
/ cgn2_6/ptodata/2/iaa/5B_COMB . pep : 
/cgn2_6/ptodata/2/iaa/6A_COMB.pep : 
/cgn2_6/ptodata/2 /iaa/ 6B_C0MB . pep : 
/cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep 
/ cgn2_6/ptodata/2/iaa/backf iles 1 . pep 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 
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DB 


ID 




Des cription 
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Sequence 


219, App 
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us- 


09-724-566A-52 


Sequence 


52, Appl 
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8 


100. 0 
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Sequence 


83, Appl 
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100. 0 


10 
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us- 
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Sequence 


19, Appl 
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100. 0 


10 
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Sequence 


19, Appl 
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8 


100 . 0 


10 
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09-548-372D-63 


Sequence 


63, Appl 
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100. 0 


10 


4 


us- 


09-548-367D-63 


Sequence 


63, Appl 
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8 


100. 0 


10 


4 


us- 


09-551-853D-63 


Sequence 


63, Appl 
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100. 0 


10 


4 


us- 


09-604-608-5 


Sequence 


5, Appli 


10 


8 


100. 0 


11 


5 


PCT 


-US94-07043A-3 


Sequence 


3, Appli 


11 


8 


100. 0 


21 


2 


us- 


08-659-984A-18 


Sequence 
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. 0 
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2 


US-08-920-162A-1 


Sequence 


1 
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369 
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. 0 
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3 


US-08-461-018A-3 


Sequence 
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Annl i 
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50 . 


. 0 


43 
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Sequence 
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3 
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Sequence 
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Sequence 
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US-09-216-958-3 


Sequence 


3 


Ann 1 i 
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3 


US-09-356-931-1 


Sequence 


1 


Ann 1 "i 


375 
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50 . 
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43 


3 


US-08-733-202-1 


Sequence 


1, 


Appli 


376 
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4 


US-08-703-675C-1 


Sequence 


1, 


Appli 


377 
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43 


4 


US-09-390-692-1 


Sequence 


1, 


Appli 


378 
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50. 
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43 


4 


US-08-617-267C-1 


Sequence 


1, 


Appli 


379 
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50. 
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43 


4 


US-08-617-267C-3 


Sequence 


3, 


Appli 


380 
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50. 
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43 


4 


US-09-303-655-1 


Sequence 


1, 


Appli 


381 
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50 . 
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43 


4 


US-08-294-819-1 


Sequence 


1, 


Appli 


382 


4 


50. 


0 


43 


4 


US-09-408-283-1 


Sequence 


1, 


Appli 


383 
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50. 
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43 


4 


US-09-280-966-1 


Sequence 


1, 


Appli 


384 


4 


50. 
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43 


4 


US-09-032-019-1 


Sequence 


1, 


Appli 


385 
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43 


4 


US-09-481-980A-1 


Sequence 


1, 


Appli 
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43 
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US-09-594-366-1 


Sequence 


1, 


Appli 
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US-08-665-649-1 


Sequence 


1, 


Appli 
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43 
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Sequence 


1, 


Appli 
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Sequence 
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Sequence 


2, 
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Sequence 
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Sequence 
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Sequence 


3, 


Appli 


397 


4 


50 . 


0 


45 


1 


US-08-123-659A-5 


Sequence 


5, 
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Sequence 


5, 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 


4 1 , Appl 


440 


4 


50. 


0 


63 


4 


US-09-419-281-86 


Sequence 


86, Appl 


441 


4 


50. 


0 


63 


4 


US-09-107-532A-5 015 


Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 


43, Appl 
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Sequence 


88, Appl 
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0 


65 


4 


US-09-419-281-43 


Sequence 


43, Appl 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 
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Sequence 


115, App 
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69 
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Sequence 


47, Appl 
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Sequence 


92, Appl 
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Sequence 
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Sequence 
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Sequence 


48, Appl 


476 


4 


50. 


, 0 


70 


3 


us- 


-08- 


-995- 


-156A-93 


Sequence 
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Sequence 


48, Appl 


478 


4 


50. 


. 0 


70 


4 


us- 


-09- 


-419- 


-281-93 


Sequence 
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Sequence 


33, Appl 
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70 
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Sequence 


33, Appl 
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0 
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Sequence 


33, Appl 
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Sequence 


49, Appl 
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71 
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-08- 


-995- 


-156A-50 


Sequence 


50, Appl 
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50. 
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72 
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-995- 
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S ecruence 
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-281-50 


Sequence 


50, Appl 
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95, AppI 
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S ecruence 
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-995- 


-156A-96 


Sequence 
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0 


73 
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-419- 


-281-51 


S ecruence 


51, Appl 
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50. 
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73 
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-281-96 


Sequence 


96, AppI 
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-156A-52 


Sequence 


52, Appl 


499 
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74 


3 


us- 


■08- 


■995- 


-156A-97 


Sequence 


97, Appl 
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50. 
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74 
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■09- 


■419- 


-281-52 


Sequence 


52, Appl 


501 


4 


50. 
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74 
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-281-97 
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97, AppI 
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74 
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■09- 
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Sequence 


4 6, Appl 
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74 
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995- 
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Sequence 
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Sequence 


98, Appl 
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419- 
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Sequence 


53, Appl 
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75 
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09- 


419- 


•281-98 


Sequence 


98, Appl 
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50. 


0 
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08- 


995- 


■156A-54 


Sequence 


54, Appl 
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50. 
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76 


3 
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08- 


995- 
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Sequence 


99, Appl 
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50. 
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76 
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419- 


•281-54 


Sequence 


54, Appl 
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50. 
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76 
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09- 


419- 
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Sequence 


99, Appl 
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Sequence 
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995- 
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Sequence 
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Sequence 


100, App 
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Sequence 


55, Appl 
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Sequence 
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Sequence 
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3704, Ap 


921 


4 


50 . 


0 


165 


4 


us- 


09- 


134- 


0O0C-5015 


Sequence 


5 015, Ap 


922 


4 


50 . 


0 


166 


3 


us- 


08- 


845- 


258-43 


Sequence 


43, Appl 


923 


4 


50. 


0 


166 


3 


us- 


08- 


990- 


571-43 


Sequence 


43, Appl 



